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“FATE” OF FIRST-LIST ASSOCIATIONS IN 
TRANSFER THEORY 


JEAN M. BARNES anp BENTON J. UNDERWOOD 


Northwestern University 


Theorists who attempt to arrive 
at an accounting of the facts of trans- 
fer and retroactive inhibition in 
verbal materials must inevitably make 
a decision concerning ‘“‘what happens” 
to first-list assoc*‘ations when a second 
list islearned. ‘The present study was 
designed to gather data which would 
make such easier. Two 
transfer paradigms were used. One 
of these, A-B, A-C, is normally asso- 
ciated with production of negative 
transfer. In the other paradigm, A- 
B, A-B’, stimuli are identical with re- 
sponses highly similar; such an ar- 
rangement normally produces positive 
transfer. 

The data from the present study 
allow an evaluation of three different 
conceptions concerning the fate of 
first-list (hereafter called List 1) asso- 
ciations during the learning of the 
second list (hereafter called List 2). 
These three positions will be exam- 
ined first as they apply to the A-B, 
A-C paradigm. 


decisions 


1. List 1 associations are unlearned or 
extinguished during the learning of the second 
list. While A-B, A-C is more comparable 
to the operations of counterconditioning than 
to extinction of conditioned responses, the 
term extinction will be used here in keeping 
with past practice. Asa theoretical position, 


the uhlearning or extinction hypothesis is 
associated with Melton and Irwin (1940) who 
used it to explain certain facts of RI. They 
suggested that at least part of RI could be 
accounted for by the loss of 
strength of List 
extinction. It might also be assumed that 
negative transfer results from the interference 
accompanying the process. A 
rather impressive amount of evidence could 
be marshalled in support of this extinction 
hypothesis. For present purposes, however, 
only an illustration of this evidence will be 
given. At various points in learning the 
second list, Briggs (1954) presented the 
stimuli common to both lists, one at a time, 
and asked S to give the first of the two re- 
sponses which came to mind. The results 
showed that during List 2 
there is a gradual decrease in frequency of 
List 1 responses, the curve being not unlike 
an extinction curve 


associative 
1 responses resulting from 


extinction 


the learning of 


\ phenomenon analo- 
gous to spontaneous recovery of the List 1 
responses was also found when retention was 
various intervals of time after 
List 2 learning 
in line with the extinction hypothesis 
ever, as Briggs points out 


measured at 
Such evidence appears quite 
How- 
a certain amount 
of ambiguity remains; the extinction curve 
does not necessarily mean that List 1 associa 
tions are extinguished in the sense that they 
His results 


List 2 associations be- 


are no longer available for recall 
that 
come stronger than List 1 associations, hence, 


may mean only 


occur more and more frequently in recall 
Such an List 2 


associations must necessarily be accompanied 


increase in frequency of 


by a decrease in frequency of List 1 associa 


tions at recall. The question still remains, 
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therefore, as to whether or not the A-B 
association is available after A-C is learned 
and this is one question the present study 
attempts to answer. 

2. The List 1 
(A-B) remains relatively independent and 
intact throughout the learning of the associa- 
tions in List 2 (A-C). This will be referred 
to as the independence hypothesis. The 
verbal stimuli are, of course, identical in the 
A-B, A-C paradigm, but there are other 
possible differentiating cues (e.g., first list 
vs. second list) which would provide a means 
of having two independent response systems 
attached to the apparent stimulus. 
Negative transfer could be said to occur in 
the process of establishing the independent 
response systems early in learning A-C. 
Retroactive inhibition could be attributed 
to loss of differentiation so that response 
competition occurs. 


system of associations in 


same 


3. A third possible conception makes use 
of mediation. Having learned A-B in List 
1, S retains this association so that List 2 
items are learned with B as the mediator; 
S learns A—B—C. Negative transfer and 
RI could occur as a consequence of confusion 
on the part of S as to which response is used 
as the mediator and which response is mediated. 


In the present study, different 
groups of Ss were stopped at various 
points in learning List 2 and were 
asked to give both List 1 and List 2 
responses to each stimulus. If (as 
degree of List 2 learning increases) 
there is an increasing inability to give 
List 1 (over and above 
normal forgetting), the extinction 
hypothesis would be favored. If 
there was no such loss either the inde- 


responses 


pendence hypothesis or the mediation 
hypothesis would be favored. Provi- 
sions were included in the design to 
choose between these latter two if it 
became For example, if 
the mediation hypothesis is tenable 
it should be found that B is given 
before C more frequently than C 
before B when S is asked to write 
down both responses. The independ- 
ence hypothesis would not predict 
such an ordering in response recall. 
Turning next to the paradigm in 
which stimuli are identical but re- 


necessary. 


sponses highly similar (A-B, A-B’), 
the present study should again pro- 
vide bases for choice among the three 
hypotheses. 


1. Because retroactive facilitation is ex 
pected with this paradigm (e.g., Young, 1955) 
it might appear than an extinction hypothesis 
is not tenable. However, by the use of 
extinction plus other factors it is possible to 
account for retroactive facilitation. It is a 
fact that more List 1 instrusions occur in the 
learning of List 2 for this paradigm than for 
the A-B, A-C paradigm. Thus, the non- 
reinforcement of these responses makes ex- 
tinction a very plausible mechanism. So, it 
could be assumed that in learning A-B’, 
A-B is extinguished. Retroactive facilitation 
could then be accounted for by mediation. 
It is known that highly similar items have 
high associative connection (Haagen, 1949). 
When S is asked to recall A-B after learning 
A-B’, even if A-B is extinguished the media- 
tion sequence in recall could be A—B’—B, 
since there has been no extinction of the 
strong association between B’ and B. How- 
ever, without the addition of still more factors 
the positive transfer in learning A—B’ cannot 
be accounted for. Since the results of the 
present procedures do not make the extinc- 
tion hypothesis a reasonable one for this 
paradigm, no attempt will be made here to 
suggest the additional factors needed to 
account for the positive transfer produced by 
this paradigm. 

2. The independence hypothesis, used in 
conjunction with a theory of response gen- 
eralization (Underwood, 1951), can be em- 
ployed to account for both positive transfer 
and retroactive facilitation in the A-B, A-B’ 
paradigm. 

3. The mediation hypothesis also provides 
an attractive alternative for this positive 
transfer paradigm. As noted above, items 
which have high similarity have high associa- 
tive connection. Thus, having learned A-B, 
the learning of A-B-B’ should be very simple 
since B and B’ are already strongly connected. 
Therefore, positive transfer should occur 
In learning List 2, if mediation occurs S will 
be given additional practice on A-B; hence, 
this association will be further strengthened 
during List 2 learning, and retroactive facilita- 
tion would be expected to occur. 


In summary, two paradigms are 
used, one associated with the produc- 
tion of negative transfer, the other 
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with production of positive transfer. 
By requiring S to recall both List 1 
and List 2 responses at various points 


in the learning of List 2, it was ex- 
pected that choice could be made 
among three conceptions (extinction, 
independence, mediation) of the fate 
of List 1 associations during the learn- 
ing of List 2. 


METHOD 


General.—Two parallel experiments were 
conducted, one using the A-B, A-C transfer 
paradigm and one using the A-B, A-B’ 
paradigm. Within each experiment there 
were four groups of Ss. All Ss learned 
a first paired-associate list (List 1) of eight 
pairs to one perfect trial. Then List 2 was 
presented for a specified number of trials, 
namely 1, 5, 19, and 20 anticipation trials 
for the four groups in each experiment. 
After the specified number of trials for a 
given group, the memory drum was stopped, 
S was provided a sheet of paper on which the 
eight stimuli were printed, and he was asked 
to write down the two responses (one from 
the first list and one from the second) which 
were associated with each stimulus. This 
written recall provided the major source of 
data. 

Lists —Eight nonsense syllables of from 
60% to 73 % association value (Glaze, 1928) 
were used as stimuli for all lists. Intra- 
stimulus similarity was low in that no conso- 
nant was repeated, and four vowels were used 
twice each. The responses were two-syllable 
adjectives taken from Haagen (1949). In 
the A-B, A-C lists there was no apparent 
interlist similarity and _ intralist 
response similarity was as low as careful 
inspection procedure will produce. In the 
A-B, A-B’ lists the responses again had low 
intralist response similarity but high interlist 
response similarity. The responses from each 
list which were paired with the same stimulus 
had similarity ratings ranging from .9 to 1.4 
by Haagen’s These high-similarity 
response pairs were as follows: insane—crazy; 
barren—fruitless; complete—entire; royal 
regal; double—twofold; afraid—scared; tran- 
quil—peaceful; spoken Three differ 
ent pairings of stimuli and responses were used 
for both experiments to avoid the possibility 
that fortuitous associations between stimulus 
and response would bias the results. An 
equal number of Ss in each group was given 
each pairing. For each experiment for each 


response 


scale. 


verbal. 


condition a given list was used as List 1 
for half the Ss and as List 2 for the other half. 
Four orders of items were used to minimize 
serial learning. 

The lists were presented on a Hull-type 
drum at a 2:2-sec. rate. The intertrial inter- 
val was 4 sec. Anticipation learning was 
used for List 1 and for List 2 to the point at 
which written recall was given (after 1, 5, 10, 
or 20 anticipation trials). A 1-min. interval 
separated the learning of the two lists. 

Subjects.—A total of 192 Ss was used. 
There were 24 Ss in each condition for each 
experiment, the conditions being the point 
at which S was asked for written recall. All 
Ss had previously served in at least 
verbal-learning experiment. However, no 
other experiment had been run in which S was 
asked to recall responses from both lists, so 
there is no reason to believe that S anticipated 
that he would be asked for such recall. It 
should be emphasized that each S was asked 
for written recall only once so that it is not 
believed that Ss made any special attempt 
to remember List 1 responses. In assigning 
Ss, a listing of 192 entries was made such that 
each of the eight conditions occurred 24 times. 
Within these limits the ordering was random 


one 


and Ss were successively assigned in order 
of their appearance at the laboratory. 

Written recall.—When the specified num- 
ber of trials on List 2 had been given, the 
memory drum was stopped and the sheet for 
written recall given to S. The stimuli were 
listed on the sheet with two blank 
under each. The Ss were first instructed to 
write down the responses from the two lists 
below the appropriate stimuli. They were 
further told to write these down as they came 
to mind, and not to attempt to recall all the 
responses from one list first and then all those 
from the other. Two minutes then 
allowed for this initial recall. Following this, 
they were instructed to go through the re- 
sponses they had written, and indicate, by 
assigning the numbers 1 and 2, from which 
list the responses had come. An additional 
2 min. were allowed for this. The Ss were 
also told to write down any additional re- 
sponses which they might think of while they 
were assigning list numbers. 


spaces 


were 


After the recall papers were collected, a 
series of questions was asked to determine: 
(a) if S had written the responses as they 
came to mind or if there was deliberate at- 
tempt to recall the adjectives from a given 
list first, (b) whether or not S had used first- 
list responses to mediate the learning of the 
second-list responses. 





JEAN M 


RESULTS 


Transfer.—The major concern in 
the present paper is with the written 
recall results. However, it will be 
worthwhile to note briefly the trans- 
fer facts and to show that random 
assignment of Ss to groups was effec- 
tive. For all eight groups combined, 
the mean number of trials to learn 
List 1 to one perfect trial was 10.36, 
with the means for groups ranging 
from 9.17 to 12.00 trials. The F was 
less than 1. Lacking a precise control 
for transfer, some indication of the 
effects can be obtained by comparing 
the learning of List 1 with that of List 
2. The total correct responses for the 
first three trials of the two lists were 
used as the response measure for the 
three groups having 5, 10, and 20 
trials on List 2. For the A-B, A-B’ 
paradigm, more correct responses were 
given on List 2 than on List 1, indicat- 
ing positive transfer. For all three 
groups combined, the mean number 
correct on the first list was 9.84, and 
for the 14.35. The ¢ was 
8.55. For A-B, A-C the mean num- 
ber correct on the first list was 9.64, 
and on the second, 8.57. This ¢ was 
1.67. This result indicates that in a 
statistical sense no negative transfer 
was present, but this is not an unusual 
finding for this type of comparison 
since learning-to-learn and warm-up 
would counteract the negative effect. 
Finally, in keeping with previous 
studies (e.g., Underwood, 1951), the 
number of overt intrusions of List 1 
responses during the learning of List 2 
was much greater for A-B, A-B’ than 
for A-B, A-C. 


Written recall. 


second, 


There were several 
alternative ways by which the written 
recall The most 
stringent method would be to count 


could be scored. 


a response as correct only if it were 


placed with the appropriate stimulus 


if correctly 
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TABLE 1 


MEAN NUMBER OF RESPONSES RECALLED 
From Eacu List ror EAcu TRANsS- 
FER PARADIGM AS SCORED BY 
Two MEtHops* 


Point of Written Recall (Trials) 
Method List bai 


Paradigm A-B, A-C 


6.67 5.38 
3.46 29 
6.96 5.58 
4.12 71 


nus 


Paradigm A-B, A-B’ 


snes 
wns a 


our 7 


sanns 
sans ss 


"In Method 1 the response is counted correct « 
identified with stimulus and list 
Method 2 it is counted correct if merely recalled 


and also correctly identified as to 
list. The most liberal method would 
be to count an adjective correct if 
recalled, regardless of whether it was 
placed with the correct stimulus and 
correctly identified as to list. The 
results for both methods are shown 
in Table 1. An inspection of this 
table will show that the differences 
produced by the two scoring methods 
are relatively small. The implication 
is that if S wrote down a response, 
he usually identified it correctly with 
stimulus and list. 

Turning first to the results for the 
A-B, A-C paradigm, the results for 
the stringent scoring method have 
been plotted in Fig. 1. As would 
be expected, as the number of learning 
trials on List 2 increases, the number 
of correct responses given from this 
list increases. Responses from the 
first or A-B list, however, show a 
gradual decline as the number of 
trials on List 2 increases. It is as if 
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(n > ~ 


MEAN CORRECT RESPONSES 











lO 20 
ON SECOND LIST 


Fic. 1. Mean number of responses cor- 
rectly recalled and identified with stimulus 
and list in the A-B, A-C paradigm. 


| 5 
TRIALS 


the A-B associations weakened 
or extinguished during the learning 
of A-C. The loss as a function of 
number of A-C trials cannot be 
attributed to a few Ss losing many 
responses since most Ss showed the 
decline. An analysis of variance of 
the four recall points for A-B gave an 
F of 8.86 (F = 4.88 for P = .01 with 
3 and 92 df). 

It was noted that Fig. 1 suggests 
that the A-B associations are extin- 
guished. It might be suggested, fur- 
thermore, that not only are these 
associations extinguished, but also 
that the response per se is not avail- 
able. This could be deduced from 
the fact that the two scoring methods 
did not differ much, implying that if 
S knew the response, he also knew 
what stimulus and what list it went 
with. 


are 


However, this conclusion can- 
not be reached with complete con- 
fidence. the in- 
structions to S stressed the recall of 


The reason is that 


the responses to the specific stimuli; 
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he was never told to put down a 
response even if he didn’t know with 
which stimulus it was paired. It is 
therefore possible that Ss did not put 
down responses unless they were 
reasonably confident of its stimulus. 

The objection might be raised that 
the decline in A-B associations seen 
in Fig. 1 is not to be attributed to the 
learning of A-C but rather represents 
simple forgetting. To make sure that 
this would not be a valid objection, 
a control group of 12 Ss was run. 
This group learned List 1 to one 
perfect trial and then rested for a 
period of time equivalent to that spent 
in learning A-C by the group given 20 
trials. This period was 13 min. 
(including the 1 min. between lists). 
The rest interval was filled with S 
working on a pyramid puzzle. After 
the rest, Ss were given a written recall 
for the responses to the stimuli in 
the list they had learned. Counting 
only those responses which were 
appropriately paired with their stim- 
uli, the mean recall was 7.75. Hence, 
it can be concluded that the decre- 
ment in recall of A-B associations as a 
function of trials on A-C cannot be 
attributed to simple forgetting, but 
must result from the learning of A-C. 

The results for the A-B, A-B’ para- 
digm as given in Table 1 are quite 
different from those of A-B, A-C. 
It may first be noted that the recall 
of List 2 responses (A-B’) is nearly 
perfect after only one anticipation 
trial and, of course, remains high 
throughout 20 trials. Indeed, by 
both scoring methods the median 
recall for all groups at all points is 
8.0 items. The first-list associations 
(A-B) show no appreciable decline. 
By the stringent scoring method the 
mean recall, even after 20 trials on 
A-B’, is nearly seven. 

Order of recall. 
ing the order of 


The facts concern- 


recall of the two 
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responses can be used as evidence 
for or against a mediation hypothesis, 
as this hypothesis was discussed in 
the introduction. If S tends to recall 
List 1 responses first, it could be 
taken as evidence in support of a 
mediation hypothesis. Contrariwise, 
it would be evidence against media- 
tion if List 2 responses tend to be 
recalled first. In this analysis data 
were not used from 12 Ss who ad- 
mitted under questioning that they 
attempted to write down all responses 
from one list first and then all from 
theother. Of these, seven had learned 
A-B, A-C, and five, A-B, A-B’. Ob- 
viously, only cases in which both 
responses were recalled could be used. 
For A-B, A-B’, there were 161, 159, 
170, and 162 pairs of responses re- 
called for 1, 5, 10, and 20 trials, 
respectively. The corresponding per- 
centages for the List 1 response being 
recalled betore the List 2 response are 
79%, 62%, 58%, and 53%. For 
A-B, A-C, the number of cases where 
both responses were recalled was 79, 
113, 109, and 95 for 1, 5, 10, and 20 
trials, respectively. The percentages 
of List 1 responses recalled first are 
81%, 49%, 40%, and 43%. Assum- 
ing chance order as 50%, it can be 
seen that (excepting the conditions 
where only one trial was given on 
List 2 and where List 1 would be 
much stronger than List 2) with A-B, 
A-C the List 1 responses tend to be re- 
called first with chance or less than 
chance frequency. For A-B, A-B’ 


first-list responses are recalled with 


greater than chance frequency al- 
though after 20 trials this frequency 
has nearly reach 50%. 

Subject reports.—Of the 96 Ss who 
learned the A-B, A-B’ paradigm, 94 
said they had used the List 1 re- 
sponses to mediate the learning of 
List 2. These Ss further reported 
that they tended to drop the use of 
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these mediators as List 2 learning 
progressed. Of those 96 Ss who had 
learned A-B, A-C, only two reported 
attempts to use the B response to 
mediate A-C learning; both Ss further 
reported that this merely confused 
them. 


DISCUSSION 


The A-B, A-C paradigm.—Three con- 
ceptions of what might “happen’’ to 
A-B during the learning of A-C were 
given in the introduction. These three 
conceptions were extinction or unlearn- 
ing, the maintenance of two independent 
S-R systems, and mediation. The pres- 
ent data give strong support to the con- 
ception of extinction and at the same 
time argue against the other two con- 
ceptions. The recall of List 1 
tions decreased progressively throughout 
the learning of A-C with nearly a 50% 
loss after 20 trials on A-C. A hyper- 
bolic equation fitted to this curve pre- 
dicts an 3.46. Thus, it 
that all items would be 
extinguished, even with an extremely 
large number of trials on A-C. This 
conforms to the fact that forgetting in 
the RI paradigm has not been shown 
to be complete with very high degrees 
of interpolated learning. Thus, while 
the present data strongly support an 
extinction hypothesis they do not indi- 
cate why all items are not, nor are likely 
not to be, extinguished. 

It seems 


assoclia- 


asymptote at 


does not seem 


reasonable to reject the 
independence hypothesis. If this hy- 
pothesis is to be retained, some mecha- 
nism will have to be added to account 
for the loss in the A-B associations over 
and above normal forgetting. If it is 
said that the A-C associations were so 
dominant that S was unable to recall the 
A-B associations, then the conception 
becomes very similar to an extinction 
hypothesis. 

The mediation hypothesis cannot be 
seriously considered in light of the evi- 
dence. If List 2 responses are learned 
through mediation of B (A—B—C) the 
A-B association should not be lost from 


S’s repertoire. If it is said that the 
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mediators drop out, then something like 
extinction must also be added to account 
Fur- 
thermore, Ss’ reporrs did not indicate 
that mediation had taken place (as was 
true of the reports for A-B, A-B’). All 
in all, the results give strong support 
to an extinction-like process; they do 
not give much support to an hypothesis 
of independence nor one of mediation. 

It was noted that if S could recall a 
response he could usually identify it 
correctly as to list. On the 
such seems 


for their loss from S's repertoire. 


surface, 
contrary to con- 
ceptions which say that some RI may 
be attributed to competition resulting 
from a lack of differentiation as to which 
response belongs in which list (e.g., 
Underwood, 1949). However, two addi- 
tional facts must be noted which are 
relevant to this matter. First, in the 
present study essentially unlimited recall 
time was given so that identification of 
with lists should be more 
accurate than when recall is paced by a 
memory rate. Sec- 
ondly, this differentiation is assumed to 
be very high when recall is given im- 
after interpolation; differen- 
tiation is assumed to decrease as a func- 
tion of time between interpolated learn- 
ing and recall (Underwood, 1949). It is 
quite possible that the high accuracy of 
identifying responses with lists would 
be lost even with the present method if, 
say, a 24-hr. interval occurred between 
interpolation and written recall. Never- 
theless, the present results would strongly 
suggest that nearly all RI could be 
accounted for by unlearning or extinc- 
tion if unlimited recall time is given 
immediately after interpolated learning. 
The lack of differentiation, leading to 
competition between responses, should 
become an additional component in RI 
only after the passage of time 
interpolated learning. Indeed, ir 
of the findings suggesting spontaneous 
recovery of extinguished associations 
(Briggs, 1954), it can be held that after 
the recovery is complete, all RI is due to 
competition 


evidence 


responses 


drum at a 2-sec. 


mediately 


since 
view 


This two-component con 
ception of RI will incorporate the major 
facts. 
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The A-B, A-B' paradigm.—The data 
for this paradigm have shown that the 
\-B associations were maintained at a 
high level the learning of 
\-B’. The very slight decline noted 
over 20 trials could be attributed to the 
extinction of the A-B responses for those 
cases in which Ss did not know the mean- 
ing of a word or words, hence, similarity 
of the responses was ineffective and the 
results were the same as an A-B, A-C 
paradigm. while accepting 
this possibility, it can be omitted from 
further consideration. 


throughout 


However, 


The question still 


remains as to whether it is or is not 
likely that the A-B were 
extinguished while S learned A-B’ even 
when effective. It 
noted in the introduction that such an 
hypothesis was a plausible one; indeed, 
looking only at the 
remains plausible. 
The possibility that A-B could be ex- 
tinguished and the present results still be 
obtained (no apparent loss in the A-B 
associations) may be handled by media 
tion in which after learning A-B’ (and 
extinguishing A-B), S recalled the B 
response because of its high associative 
connection with B’. Thus, when asked 
to recall, the associations were A to B’ 
and then B’ to B. If this took place, 
A-B could be extinguished and the pres- 
ent basic results would still be obtained. 


associations 


similarity was was 


basic results it 


However, other findings argue against 
this interpretation and tend instead to 
support mediation in which A leads to B 
and B leads to B’ at recall. 


1. If B is recalled via B’, B’ should be 
recalled before B Ihe data indicate that B 
is more often recalled first. It is true that 
after 20 trials B and B’ were recalled first 
about equally often. So, it is still possibl 
that with high degrees of A-B’ learning, some 
extinction of A-B did occur 

2. Nearly all Ss report using B as a medi- 
to learn A-B’; none reported that B’ 
was used as a mediator in the recall of B 
However, they did report that the use of B 
as a mediator tended to drop out as learning 
of A-B’ proceeded. Some extinction of A-B 
might have been a concomitant or cause of 
this drop out. But, the fact that the per 
centage of times in which A-B was recalled 
first (before A-B) never drops below 50% 


ator 
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suggests that if extinction was occurring it 
was much less in amount than that which 
occurred in A-B, A-C. 


The conclusion is, therefore, that while 
the possibility of extinction of A-B in 
the A-B’ paradigm cannot be completely 
ruled out, it does not appear as com- 
pelling an interpretation as does the 
mediation hypothesis in which A-B 
mediates B’. 

Finally, what about the hypothesis of 
independent response systems in which 
response generalization is used to account 
for positive transfer 
facilitation? 


and retroactive 
The use of response gener- 
alization to explain certain facts of trans- 
fer has been of considerable help in 
organizing the facts of transfer. One 
particular statement of this formulation 
can be briefly summarized (Underwood, 
1951). It is that when A-B 
is being learned, all responses similar to 
B likewise develop associative 
strength to A. The amount of such 
strength, developed through what has 
been called parasitic reinforcement, is 
directly related to similarity of the re- 
sponse with the B response. 


assumed 


some 


Positive 
transfer occurs, therefore, because less 
learning has to occur in the second list 
than in the first. Thus, in the present 
situation, if A-B’ develops some associa- 
tive strength as a consequence of learn 
ing A-B, positive transfer should occur 
when S is asked to learn A-B’. Retro 
facilitation occur, since 
\-B’ the A-B 
further strength- 
noted that this 
with the associa 
tions between A and B, and between A 
and B’; it does not deal in any way with 
the already 


active would 
learning of 


would be 


during the 
association 
ened. It 

hypothesis deals only 


should be 


well-established association 
between B and B’ which forms a central 
part of the mediation hypothesis. 
The evidence suggests 
the response-generalization 


that 
accounting 


present 


of certain facts of transfer may well be 
abandoned in favor of a mediation hy- 
pothesis. The mediation hypothesis will 
account for the same general facts as 


will response generalization ; furthermore, 


certain facts of the present experiment 
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difficult to reconcile with the 


response-generalization hypothesis. 


seem 


1. Almost 
mediation. 


without exception, Ss report 
While it may be possible to 
extend the theory of response generalization 
to account for this fact that Ss report media- 
tion, the mediation per se 
account for the 
elaboration 

2. It was noted that after only one an- 
ticipation trial, recall of List 2 was nearly 
pertect. 
tion 


can be used to 


results, without further 


Ihe theory of response generaliza- 
1 generalization gradient de 
creasing rather sharply as similarity decreases 
The generalized association is not assumed to 


assumes 


grow in associative strength at a rate com- 
parable at all to the directly-reinforced asso- 
ciation (A-B). To account for the extraor- 
dinarily rapid learning of A-B’ would mean 
that there is essentially 
B and B’, or to say this another way, the 
flat. In previous experiments, 
no such rapid learning of A-B’ was noted; 
learning was more rapid than for a control 
condition but nothing like that shown here 
in the written recall. Indeed, in the present 
study for the groups having 5, 10, and 20 
trials (before written recall), the mean num- 
ber of items recalled on the second anticipa- 
tion trial (comparable to the point at which 
written recall 
having only 


no gradient between 


gradient is 


was asked for in the group 
1 anticipation trial) was 5.04 
This is to be contrasted with 7.75, obtained 
on written recall at the same point. The 
reason for this discrepancy, it is believed, 
is that under the procedures of 
anticipation learning S does not have time to 
mediate all items. 


standard 


Given more time, recall 
is nearly perfect right at the start of “‘learn- 
ing’’ the This finding is not 
theory of 


second list. 
with the 
generalization 


compatible response 


The conclusion is that mediation of B’ 
A-B is a appropriate 
formulation than response generalization 


through more 
to account for positive effects in transfer 
and retention when response similarity 
is involved with identical stimuli. As 
similarity between responses decreases, 
the associative connection between them 
will likewise decrease; thus, a gradient 
like phenomenon will appear in transfer. 
With 


sponses, 


moderate similarity between re 
the associative connection be 
tween the two will be less than in the 


present case, and additional strengthen- 
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ing of the associative connection will 
be necessary before mediation 
pletely effective. Thus, positive trans- 
fer will be less than when the responses 
are highly similar. It is apparent that 
since no evidence for mediation could be 
found in A-B, A-C, the mediation hy- 
pothesis must be abandoned when the 
responses level of dis- 


is com- 


reach a certain 


similarity (or a low level of similarity). 
Itis at this point that extinction begins to 
occur with consequent negative transfer. 


SUMMARY 
The A-B, A-C and the A-B, A-B 


paradigms were studied in order to evaluate 


transter 


three conceptions concerning the fate of first 


learning a second list; 
namely: extinction of first-list 


two independent 


list associations in 
associations, 
S-R sys- 
learning of the 
by the first-list associa- 


maintenance of 
and mediation of the 
second-list 
tion 


tems, 
response 
For each paradigm, different groups 
of 24 Ss each were stopped after 1, 5, 10, or 20 
anticipation trials on List 2 and were asked 
to write down both List 1 and List 2 responses 
to the stimuli Both 
pairs of nonsense syllables and 
List 1 
one perfect trial 
For the A-B, A-C paradigm there was a 
gradual reduction in reproduction of List 1 
responses as List 2 learning ir 
creased. A condition showed that 
this reduction could not be accounted for by 


lists contained eight 
two-syllable 
was carried to 


adjectives learning 


degree ot 
control 
normal forgetting No successful mediation 
of the C 
It was concluded that of the three alternative 


response via A-B was reported 
conceptions, extinction of the List 1 responses 
It appears that 
inhibition 


was clearly to be preferred. 


nearly all retroactive measured 
immediately after interpolated learning may 
unlearning of first- 
list responses during the learning of the second 
list 

For the A-B, A-B’ paradigm, the List 1 


responses showed no appreciable loss over 


be due to extinction or 
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20 trials and List 2 was given nearly perfectly 
after one anticipation trial. Of 96 Ss, 94 
reported the use of the A-B association to 
mediate the B’ response. The rapid learning 
of List 2 is understandable by the mediation 
hypothesis, but some extinction of the A-B 
association in the A-B, A-B’ paradigm cannot 
be ruled out completely by the present data 
The A-B 

and then 

occurs via 


association be extinguished 
mediation of the List 1 

A-B’-B. However, considering 
all evidence, it seemed most probable that 
mediation occurs most frequently in the order 
\-B-B’. 


transter 


may 
response 


Finally, the evidence suggests that 
effects produced by variation in 
response similarity can be more simply ac 
counted for by mediation than by 
using response generalization 


a theory 
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STUDIES OF DISTRIBUTED PRACTICE: XIX. THE 
INFLUENCE OF INTRALIST SIMILARITY WITH 
LISTS OF LOW MEANINGFULNESS! 


BENTON J. UNDERWOOD anp RUDOLPH W. SCHULZ 


Northwestern University 


In the immediately preceding study 
by Underwood and Richardson (1958) 
the interactions between meaning- 
fulness and intertrial interval and 
between intralist similarity and inter- 
trial interval were studied. Massed 
practice was defined as 8 sec. between 
trials, distributed practice by 38 sec. 
The findings in this study, which 
form the immediate background for 
the present one, follow. 


i. With a serial list constructed of sylla- 
bles of from 93-100% association value 
(Glaze, 1928) and having low intralist simi- 
larity, distributed and, massed 
practice produced equivalent rates of learning. 
With a list of syllables of the same association 
value but with Aigh intralist similarity, 
distributed practice was markedly superior 
to massed Thus, 
indicated a_ strong interaction between 
intertrial interval and intralist similarity; 
the greater the intralist similarity the greater 
the effect of distributed practice. The 
generality of this finding, however, 
questioned by the second major finding 

2. With serial lists constructed of sylla- 
bles of from 0-20% association value and of 
low and high intralist similarity, the results 
were inconclusive. While distributed prac- 
tice resulted in somewhat better performance 
than did massed practice for both levels of 
intralist similarity, the differences were not 
significant and there evidence of 
interaction between intralist similarity and 
intertrial interval as found with the 
lists of high meaningfulness. 


practice 


practice these results 


was 


was no 


was 


These inconclusive results with the 
lists of low meaningfulness prompted 
the present which 
only lists of low meaningfulness are 


investigation in 


‘This work was done under Contract 
N7onr-45008, Project NR 154-057, between 
Northwestern University the Office of 
Naval Research. 


and 


used. Several considerations directed 
the choice of conditions. (a) In the 
previous study a total of 30 acquisi- 
tion trials was given under all condi- 
tions. The level of learning for the 
high-meaningful lists was consider- 
ably higher than for the lists of low 
meaningfulness. It is possible that 
with additional trials the expected 
interaction between intertrial interval 
and intralist similarity would emerge 
for the lists of low meaningfulness 
as it did for the lists of high meaning- 
fulness. Therefore, 40 acquisition trials 
were given in the present experiment. 
(b) The effect on learning of a given 
difference in formal intralist similarity 
seems to be greater with lists of low 
meaningfulness than for lists of high 
meaningfulness. Why this may be 
true was discussed in the previous 
paper, but just what implication this 
has for a possible interaction between 
intertrial interval and intralist simi- 
larity for lists of low meaningfulness 
is not clear. In any event, it seemed 
necessary to make sure that a broad 
range of intralist similarity was used. 
Consequently, the range of formal 
intralist similarity has been extended 
in the present study. (c) Two of the 
four lists used are the same as those 
used in the previous study. In order 
to explore more fully the effect of 
length of intertrial interval, the two 
intervals used here are 2 sec. and 17 
sec. (as opposed to 8 sec. and 38 sec. 
in the earlier study). The learning 
of two lists, therefore, can be evalu- 
ated as a function of four different 
intertrial intervals. 
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METHOD 


Lists.—Four serial lists were used, each 
consisting of 10 nonsense syllables having a 
Glaze (1928) association value of from 0-20%. 


rhe four lists were as follows: 
List 2 List 3 


SIW GOK 
QAS GUW 
GAH QUW GYK 
MEQ KOJ KIG 

SIJ BE] KEZ 
TOV YAB WUG 
NUB YOV WEZ 
YIL XEZ ZIK 

ZOK ZIK ZYW 
WUP VUX ZOW 


List 1 
CEF 
DAX 


List 4 
XAZ 
ZAX 
XEY 
YOZ 
ZUY 
YUZ 
X1LY 
ZEY 
YIX 
ZOY 
Intralist similarity increases from List 1 
thréugh List 4 as follows: List 1 has no 
duplicated consonants; in List 2, 10 
sonants are used twice each; in List 
consonants are 


con- 
3, four 
used five times each, and in 
List 4, one consonant was used eight times, 
one seven, and five times. A vowel 
twice in list. Repeated con- 
sonants were not used systematically as first 
or as last letters, but occurred in both posi- 
tions 


one 


occurs each 


Lists 1 and 3 were used in the previous 

study (Underwood & Richardson, 1958). 
Conditions and Ss. 

by two groups of 


Each list was learned 
Ss, one having a 2-sec. 
intertrial interval and one having a 17-sec. 
intertrial interval. Thus a total of eight 
groups of Ss was used and each group con- 
tained 30 Ss, all naive to verbal learning in 
the laboratory. A randomized order of lists 
and intervals was made up subject to the 
limitation that each list occur 30 times with 
each interval. The Ss were assigned suc- 
cessively in order of their appearance at the 
laboratory. 

Ten different serial orders of the items in 
each list were used such that each item oc- 
curred once at each serial position. Three Ss 
in each group learned with a given order. 
Three horizontal asterisks were used as the 
anticipatory cue for the first syllables. Rate 
of memory-drum presentation was 2-sec. per 
item with the anticipation method being 
used throughout. A total of 40 such anticipa- 
tion trials was given. The groups having 
distributed practice worked on a cancellation 
task during 15 sec. of the 17-sec. distribution 
interval. 


RESULTS 
Total correct over 40 trials.—The 
mean total correct responses over 40 
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The base- 
line is plotted as four equally sepa- 


trials are shown in Fig. 1. 


rated points although no assumption 
is made that the lists 
represent equal increments in intra- 
list similarity. It is to be noted first 
that under massed learning perform- 
ance decreases as intralist similarity 
increases. The second fact shown 
in Fig. 1 is that distributed practice 
consistently leads to better perform- 
ance than massed practice. 
Both of these above facts are sub- 
stantiated statistically. The F for 
Lists (Intralist similarity) is 15.27; 
with 3 and 232 df, the F at the 1% 
level is 3.88. The F for intertrial 
interval is 29.86; with 1 and 232 df, 
the F at the 1% level is 6.76. There 
is no statistical evidence of hetero- 
geneity of variance. From Fig. 1 it 
also appears that the effect of dis- 
tribution is greater the greater the 
intralist similarity. If such were 
true, it would substantiate the finding 
of the earlier study with lists of high 
meaningfulness. Statistically speak- 
ing, this interaction falls just short 
of the 5% level. The F is 2.58; with 
3 and 232 df, an F of 2.65 is needed 
for significance at the 5% level. 
Therefore, on this evidence alone, 
there is no strong support for the 


successive 


does 
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Fic. 1. Total correct responses over 40 
trials as a function of intralist similarity and 
intertrial interval. 
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Fic. 2. Total correct responses on Trials 
31-40 as a function of intralist similarity and 
intertrial interval 


interaction between intertrial interval 
and intralist similarity. 

It will be noted that the difference 
in performance resulting from massing 
and distribution is greater for List 1 
than for List 2. It is not known 
whether this results from some peculi- 
arity in a list or whether a sampling 
variation is involved. Complete learn- 
ing curves show that the difference 
in massing and distribution for List 
1 is evident throughout most of the 
40 learning trials, whereas for List 2 
the curves clearly separate only after 
20 trials. As a point of 
analysis, therefore, a measure using 
only performance late in learning is 
used. 

Totai correct on last 10 trials.—As 
noted in the introduction, data from 
the previous study (Underwood & 
Richardson, 1958) suggested that had 
learning been carried further in that 
study for lists of low meaningfulness, 
a significant interaction between in- 
tralist similarity and intertrial inter- 
val would emerge. In the present 
study, 10 additional trials were given. 
On the assumption that the effects 
of massing may be accumulative, a 
test for the interaction has been made 


second 


SCHULZ 


over the last 10 trials in the present 
experiment. The mean total correct 
responses on Trials 31-40 are shown 
in Fig. 2. Again, the effects of the 
two major variables (intralist simi- 
larity and intertrial interval) are 
highly significant. Furthermore, the 
interaction between the two variables 
gives an F of 3.33, with an F of 2.65 
needed for the 5% level, and 3.88 for 
the 1% level. The interpretation 
is not completely unambiguous, how- 
ever, since the Hartley test shows the 
maximum difference in variances to 
be significant beyond the 1% level. 
Nevertheless, it does not seem un- 
reasonable to conclude that there is 
a true interaction effect. It will be 
noted that the effect of distributed 
practice with Lists 3 and 4 is large; 
the performance on these lists is 
approximately the same as that for 
List 1 (low similarity) learned under 
massed practice. Thus, distributed 
practice almost completely cancels 
the deleterious effects of increasing 
intralist similarity. 

Co-ordination with previous experi- 
ment.—In the previous experiment 
(Underwood & Richardson, 1958) 
Lists 1 and 3 were used with inter- 
trial intervals of 8 sec. and 38 sec. 
The Ss for both experiments were 
drawn from the elementary psychol- 
ogy course and all were naive to 
verbal learning. No large differences 
in ability level between the groups of 
Ss in the two experiments would be 
expected. In the previous experi- 
ment 30 anticipation trials were 
given; the data for the two experi- 
ments are brought together in terms 
of mean number correct on Trials 
26-30. In the previous study 50 Ss 
were used for each condition. To 
equate N for the two experiments for 
statistical purposes, 30 of the 50 Ss 
were selected randomly subject to 


the restriction that each of the 10 
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Fic. 3. Total correct responses on Trials 
26-30 as a function of intralist similarity 
and intertrial interval. List 1 had relatively 
low intralist similarity, List 3, relatively high 
rhe filled represent 
previous experiment. 


circles data from a 


orders of the items for a given list was 
represented by three Ss. The results 
shown in Fig. 3. Again, both 
intertrial interval and intralist simi- 
larity are highly significant. The 
interaction term, however, falls far 
short of significance even though the 
change in performance between 2 sec. 
and 17 sec. for List 3 is much greater 
than for List 1. 
intertrial interval no appreciable in- 


are 


Beyond a 17-sec. 


crease is produced by distributed 


practice. 


Although no data on the 
matter were presented in the previous report, 


Other findings 


it was observed that the bowed serial-position 
curves were different for massing and dis- 
tribution when distribution facilitated learn- 
ing. Distribution facilitated learning more 
for items in the later serial positions than 
This effect is maxi- 
mized in the present study for Lists 3 and 4 
where the effect of distribution 
greatest This may be seen in Fig. 4 where 
the mean number of responses for 
each serial position for Lists 3 and 4 com- 
bined are plotted. As can be seen, there is a 
gradual increase in the difference between the 
two curves for successive serial positions. 


in the early positions. 
was the 


correct 


In all four lists in the previous study per- 


formance under the distributed conditions 
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than 
proximately the first six 
this point a “switch-over’ 
distributed 


was poorer under the massed for ap- 
trials. At about 
occurred and the 
became superior. It 
was believed that this effect was due to one 


learning 


of two causes a) In distributed learning 
S had to learn to turn to the cancellation 
task at a signal at the end of a trial and to 
turn back to learning at a signal at the end 
of the cancellation period. Some confusion 
may have resulted from this procedure so 
that the learning was initially 
the distributed 
massed 


lower under 
than under the 
(6) Another possible cause was that 


distributed practice 


condition 


allowed for a_ slower 
warming up than did massed practice. That 
is, if warm-up is lost during the distribution 
intervals, maximum warm-up 
tained more quickly by Ss 
massed than by 
under distributed conditions 


would be at 
learning under 


conditions learning 


those 
In the present 
experiment there was absolutely no evidence 
for the switch-over effect rhis would indi- 
cate that the second of the two above alterna- 
tives is a more plausible explanation than the 
first for the switch-over effect observed in the 
previous study. In the present study the Ss 
serving under distributed-practice conditions 
would have had to learn to switch back and 
forth learning to cancellation just as 
did the Ss in the previous experiment But, 
in the previous experiment, the distribution 
interval 38 sec 


from 


was as opposed to the 17 
sec. used in the present study It is therefore 
possible that loss of warm-up would be greater 
between trials with the distribution 
interval thus producing the switch-over effect 
which did not 
distribution interval 


longer 


occur here with a_ shorter 
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Fic. 4 
trials as a 
intertrial interval The 
3 and 4 combined 


Total 


function of 


correct over 40 
serial position and 


Lists 


responses 


data are for 
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DISCUSSION 


The results clearly show that dis- 
tributed practice facilitates the learning 
of lists of low And, 
although there is some statistical am- 
taken in 
conjunction with the ones from Under- 
wood and Richardson (1958) 
that intralist similarity is the 


interacting variable. The evidence points 


meaningfulness. 


biguity, the present results 
indicate 


critical 


rather persistently to the fact that with- 
out a certain minimum degree of inter- 
ference, distributed practice will not 
facilitate learning. In the list with the 
lowest degree of intralist similarity used 
here, there still remain duplicated vowels 
this 
necessary for 


which could produce minimum 


degree of interference 
distributed practice to facilitate learning. 
It is also possible that as meaningfulness 
gets lower and lower a new source of inter- 
ference develops, namely, an interference 
letter- 
Whatever the source, 
necessary 
distribution 
minimum number of conflicting response 


from previously well-established 
sequence habits. 
that a 


for facilitation by 


it appears condition 


is some 


tendencies. 

that some form of 
function of 
conflicting response tendencies and that 


If one chose to say 
inhibition develops as a 
this inhibition dissipates with relative 
rest, the 
problems. 


data would offer no 


Such an 


present 
serious inhibition 
to dissipate very rapidly 
(a) The effects 
of distribution appear to be asymptotic 
at about 15 sec. even with very high 
intralist similarity; (0) 
less in the early serial positions than in 


would need 


for the following reasons: 
The effects are 
the later ones, suggesting that even the 


2 sec. between trials in the massed condi- 
tions allows some loss of inhibition but 


which builds up rapidly again as S is 


presented the list; (c) There is an appre- 
ciable difference in rate of learning with 
a 2-sec. intertrial interval as compared 
with an 8-sec. interval. 

It is possible to construct a build-up 
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function and a dissipating function for 


such an inhibition which will mediate 


the present results. The problem arises, 
however, in attempting to specify further 
inhibition, 
cially how it related to learning. 


characteristics of the espe- 
The 
classical performance-type inhibition does 
not seem appropriate for a number of 
But, the present authors are 
not yet prepared to offer an alternative. 


reasons. 


SUMMARY 


In an attempt to clarify some ambiguities 
found in a previous study, four serial lists of 
10 nonsense syllables of low meaningfulness, 
varying in 
with a 2-sec 


17-sec 


intralist similarity, were learned 

intertrial interval and with a 
intertrial interval. The eight groups 
of 30 Ss were naive to verbal learning. Each 
list was presented for 40 trials. 

Che results show that distributed practice 
facilitates learning at all levels of intralist 
similarity. While the statistical evidence on 
the matter is not entirely convincing, it was 
concluded that distributed practice produces 
a greater facilitation in lists of high intralist 
similarity 
larity 


intralist simi- 
rhe effect of distribution is greater 
for items in the late serial positions than for 
In comparing 
the results with those of a previous study it 
concluded that intertrial intervals be- 
yond about 15 sec. would not produce addi- 
tional facilitation in learning 

It was pointed out that distributed prac- 
tice facilitates 


than in lists of low 


those in the early positions. 


was 


learning only when some 
minimum number of conflicting responses are 
The results are compatible with 
an inhibition theory when this inhibition is 


produced by interference 


present. 
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MIXED VS. UNMIXED LISTS IN TRANSFER STUDIES 


HELEN M. TWEDT anp BENTON J. UNDERWOOD! 


Northwestern University 


In 
been 
with 


relatively recent years there has 
an increase in the frequency 
which mixed lists (ML) have 
been used to study transfer with 
verbal materials. In MLa single 
group of Ss may be used. Each S 
learns only two lists in which different 
relationships between subgroups of 
items allow the simultaneous observa- 
tion of the transfer effects produced by 
these different relationships. Thus, 
if each of two lists consisted of nine 
paired associates, three pairs in each 
list could form the A-B, A-C para- 
digm, three the A-B, C-B, and three 
an A-B, C-D paradigm. 

The ML procedure may be con- 
trasted with the classical one in which 
unmixed lists (UL) are used. With 
UL the between all 
items in the two lists are the same. 
Thus, if the paradigm was A-B, A-C, 
all stimuli in the two lists would be 
the same and all responses different. 
Obviously, to study transfer effects 
for more than one paradigm requires 
as many different sets of lists as there 
are paradigms. 

Certain contradictions in the facts 
of transfer could be attributed to the 
possibility that UL and ML produce 
different transfereffects. Theclearest 
example of this possibility is seen 
in the work of Mandler and Heine- 
mann (1956) and Porter and Duncan 
(1953). In both of these studies an 
A-B, A-C paradigm was used, and 
also a paradigm that will be called 
A-B, A-Br. In this latter paradigm 
the same stimuli and same responses 


relationships 


are used in both lists (UL) or among 


! We are grateful to R. W. Schulz for his 
critical reading of the manuscript. 
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a subgroup of items in the two lists 
(ML), but they are re-paired in the 
second list. Porter and Duncan, 
using UL, found poorer second-list 
performance on A-B, A-Br than on 
A-B, A-C, while Mandler and Heine- 
mann found just the opposite. There 
are other differences between the two 
studies over and above UL vs. ML 
which could account this dis- 
crepancy. Indeed, study (1958) 
published after the present one was 
completed essentially confirmed the 
Porter-Duncan results ML. 
Therefore, other differences between 
the studies may be responsible for 
the different results. In other con- 
tradictions in the literature (e.@., 
Bugelski and Cadwallader [1956] vs 
Osgood [1946]) ML vs. UL is again 
one several possible factors in- 
volved. Therefore, a direct test of 
the transfer effects for the two kinds 
of lists is indicated to decide whether 
or not ML vs. UL is a variable of 
importance in the study of transfer. 
The major purpose of the present 
study is to make such a test. 

There was an a priori reason to 
suggest that ML vs. UL is a variable 
influencing the relative amount 
transfer among different paradigms. 
This may be by considering 
the situation in which S learns A-B, 
A-C with UL. While this paradigm 
can produce negative transfer, it has 
always been puzzling why the 
negative effects are small, and why 


for 
a 


using 


of 


ol 


seen 


(a) 


(b) so few B responses intrude during 
the learning of A-C. This would be 
less puzzling if it is assumed that two 
relatively independent response sys- 
tems differentiated 


are maintained, 
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by temporal or first and second list 
distinctions. If this differentiation 
is possible, S could instruct himself 
not to give responses from the first 
list and a relatively small amount 
of negative transfer would be ex- 
pected. However, if the ML pro- 
cedure is used in which some of the 
first-list responses are appropriate in 
the second list (A-B, C-B or A-B, 
A-Br) and some are not (A-B, A-C), 
the maintenance of the independent 
response systems is not possible. The 
prediction would be, therefore, that 
negative transfer would be greater 
for ML than for UL in the A-B, A-C 
paradigm. It 
other interactions among effects of 
other paradigms could occur in ML 
and not in UL. 


is also possible that 


METHOD 


Lists.—The transfer effects for 
digms were studied, each by the use of both 
UL and ML. If A-B first 
list, the four paradigms may be symbolized 
in the second list (UI 
items in the second list (MI 
C-B, and C-D 


considered the control condition, in that no 


flour para 


represents the 


or in subgroups of 
is A-C, A-Br, 
The last paradigm may be 
systematic identities obtain between the 
' between the 
ML. 

The materials were paired-associate lists 
icjec tives. All 
Ss in ail conditions with both UL and ML 
learned exactly econd list. There 
fore, any differences in transfer resulting from 
ML vs. UI be attributed to differ- 
ences in list or pair difficulty rhis second 
list consisted of the 
honest 


items in the two lists (UL) or 
particular subgroup of items for 


of 12 pairs of two-syllabl 


the same 
cannot 
following 12 pairs: 
senior—rotten, certain—aloof, 
unshut, complex—wval- 
noonday, zigzag—absurd, luke- 
stubborn, profane—bitter, famous 
exact, filthy—rising These 12 
selected by use of a table of random numbers 
from a pool of 24 pairs used by Young (1955 
in his low-similarity The items 
forming the remaining 12 pairs were used 
as necessary to construct the first’ list: for 
UL, all 12 pairs were used for the A-B, C-D 
paradigm, none was used for A-B, A-Br, and 
12 items{were necessary for A-B, A-C, and 


A-B, C-B, 


frantic, 
ready—severe, human 
iant, single 
warm 


pairs were 


conditions 
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For ML, with 4 conditions and 12 pairs 
in a list, each condition was represented by 3 
pairs. The use of four groups of Ss for ML 
made it possible to use all 12 pairs in the 
second list for each paradigm. Thus, in UL, 
four groups of Ss were needed, each group 
having a given paradigm, and this paradigm 
held between all 12 pairs in the two lists 
In ML, four groups were also used, and for all 
four groups combined, the relationship be- 
tween the first and second lists was exactly 
the same as for UL 

Procedure.—The first list was presented 
until one perfect trial was attained. After 
1 min. the second list was presented for 10 
Learning the anticipation 
method, with the stimulus being presented for 
2 sec. and the stimulus and response together 
for 2 sec., as timed by a Hull-type memory 
drum. The intertrial interval was 4 se 
rhree different orders of the lists were used to 
ML the three 
pairs representing a given paradigm were not 
grouped; that is, 
was such that those representing one para 


trials was by 


minimize serial learning. In 
the ordering of the pairs 


digm were mixed up with those representing 
other paradigms. In neither ML nor UL was 
S instructed concerning the nature of the 
relationships between the two lists 

Four groups of 18 Ss each learned under the 
ML procedure and four other groups of the 
same number learned under the UL procedure 
An order of conditions was made up such that 
each of the eight conditions was represented 
once by each block of eight Ss. Within each 
block the random The Ss 
were simply assigned in order of their appear- 
All Ss had previous 
experience in verbal-learning experiments 

It should be emphasized that all Ss learned 
the same second list rherefore, it is of little 
or no consequence whether or not first lists 
differed in difficulty; this would be true for 
comparisons within UL or within ML, and it 
would be true for comparisons between UL 
ML. In actual fact, for eight groups, 
the over-all mean number of trials to learn 
the first list was 12.7 trials, with a range of 
from 10.5 to 15.2. An analysis of 
yielded an F of 1.46, which is far from signifi- 
cant that 
these first lists did not differ in difficulty since 
possible ability differences among groups and 


ordering was 


ance at the laboratory 


and 


variance 


his does not mean, of course, 


possible list differences are confounded 


RESULTS 


The results will be presented first 
in graphical form and then attention 
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will be given to statistical analyses. 
In Fig. 1, the mean total correct 
responses on the 10 transfer trials 
are shown for each condition for both 
UL and ML. For UL, each condi- 
tion represents the mean performance 
of 18 Ss on 12 pairs. For ML, each 
condition represents the performance 
of 72 Ss, each learning three pairs. 
To coordinate the values for UL and 
ML, the total correct responses were 
Summed for successive groups of four 
Ss each on ML, such that all 12 items 
representing a condition for UL were 
likewise represented for each sub- 
group of four Ss in ML. Thus, in a 
18 Ss were ‘‘constructed”’ out 
of the 72 Ss learning under ML, and 
this was done for each transfer condi- 
tion. 

In Fig. 1, it may first be noted that 
differences between UL and ML are 
very small. The second fact shown 
in Fig. 1 is that if C-D is considered 
the control condition, all three other 
conditions show negative transfer, the 
amount of this 


sense, 


negative transfer 
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ransfer trials as a function of mixed and un- 


correct responses over 10 


mixed lists and four transfer paradigms 


UNMIXED LISTS IN 


TRANSFER STUDIES 


increasing from C-B, to A-C, to 
A-Br. 

In Fig. 1, the values represent the 
total correct over all 10 
transfer trials. It is conceivable that 


a different ordering of the conditions 


responses 


could occur at various stages in learn- 
ing the second list. That this is not 
true is shown by Fig. 2, where it is 
seen that the ordering of conditions 
remains relatively constant through- 
out all 10 trials. Also, Fig. 2 shows 
quite clearly that the differences be 
tween UL and ML are of small con- 
sequence. The coordination between 
UL and ML was handled 
way for Fig. 2 as for Fig. 1 


the same 


The first statistical analysis consists of a 
direct test of the results for UL vs. MI rhe 


measure used was the number of correct 


Phese 


total correct 


responses over the 10 transfer trials 


comparisons were made for the 
for each group of three items 
in ML one 18 Ss had 
group of three pairs for A-Br 


three pairs were also learned by 


For example, 
1 particular 
These same 

18 Ss in Ul 

Four ¢ tests could be made for each condition, 

therefore, and these four tests would exhaust 

all 12 items used for that condition. With 

four conditions (A-Br, A-C, C-B, and C-D 

and four tests for each condition, a total of 

16 ¢ tests involved The 


that not a single ¢ was sig 


group ol 


was results were 


ificant, using the 
05 significance level Thus, it may be con 
cluded that for the present materials 
conditions, whether | ML is 


determi v the 


and 
used is of no 
consequence in amount of 
transter 

The statistical analy 


UL procedure 


sis of results for the 
straightforward 
Simple analysis of variance showed conditions 
to be a highly 


Several ¢ tests (using the pooled estimate for 


alone was 

significant source of variance 
the error term) show the following facts: (a 
If C-D is considered the control 
A-C and A-Br resulted in significant negative 
transfer whereas C-B did not rhe ¢ values 
were 6.29, 2.68, and 1.52, respectively, 
34 df. (b) The amount of negative transfer 
produced by A-Br is greater than that pro 
duced by A-C. The tis 3.61. A comparable 
inalysis performed on the mean correct re 
[rials 1 and 2 did not change the 
reached for the analysis of the 
ill 10 trial 


condition, 


with 


sponses on 


conclusions 


tot il correct ovet 
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Acquisition curves over 10 transfer trials as a function of mixed and 
unmixed lists and four transfer paradigms 


For the statistical analysis of the results 
for ML alone, direct-difference ¢ tests were 
used among conditions, each based on 72 Ss, 
with each S having three items learned under 
each condition. Comparing C-D with each 
of the other three conditions gave ¢ values of 
10.09, 4.59, and 2.40, for A-Br, A-C, and C-B, 
respectively. The only which 
differs from those reached for UL is that in 
ML the C-B condition resulted in significant 
negative transfer (the ¢ of 2.4 is at the .02 
significance level). 


conclusion 


For the A-C and A-Br conditions, 
a count was made of intrusions of 
first-list responses during the learn- 
ing of the second. These were counted 
only if they occurred to the stimulus 
with which they had been paired in 
the first list. For A-Br, the total 
intrusions over 10 trials was 182 for 
ML and 154 for UL. For A-C, the 
values were 5 for ML and 4 for UL. 
Thus, the nature of the list does not 


appear to seriously affect intrusion- 
making tendencies. 


DISCUSSION 


The results have shown that UL and 
ML produce essentially the same transfer 
results. Indeed, it is not likely that 
correspondence would have been higher 
had the two experiments used the same 
method (either UL or ML). 
plication of these results is that contra 
dictions in the facts of transfer, noted in 
the introduction, are probably not due 
to ML vs. UL. A second implication is 
that the possibility that S maintains 
independent response systems (also sug- 
gested in the introduction) in the A-B, 
A-C paradigm is untenable. The third 
implication is that it appears that 
investigators may choose either ML or 
UL procedures, depending 

cumstances which make one 
than another for a 
investigation, without concern 
that the transfer results will be affected 
by this choice. 


One im 


upon cir- 
more de 
sirable particular 


seric us 


It is possible, of course, 
that the present near-identity of results 
for UL and ML will not hold with all 
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materials, e.g., consonant syllables, al- 
though there is no obvious reason why 
they should not. 

Concerning the transfer effects, the 
present experiments confirm Porter and 
Duncan (1953) and Besch and Reynolds 
(1958) in showing that A-Br produces 
more negative transfer than A-C. That 
Mandler and Heinemann (1956) actually 
found positive transfer with A-Br and 
negative transfer with A-C may be 
attributed (as suggested by Besch and 
Reynolds) to the low meaningfulness of 
the responses used by Mandler and 
Heinemann. However, this matter will 
not be pursued here. It is sufficient to 
say that it does not appear that the 
source of contradiction is in the nature 
of the lists (ML vs. UL). 

The one finding that does not fit 
current theory or previous findings is 
that the A-B, C-B paradigm produced 
negative transfer when evaluated against 
A-B, C-D. It is true that the amount 
of negative transfer is small, but it is 
consistent in both experiments and is 
present throughout most of the 10 trans- 
fer trials. Furthermore, there is reason 
to believe that the negative factor (or 
factors) may be greater in extent than 
actually appears in the performance. 
It is known (Underwood, Runquist, & 
Schulz, 1959) from other research that 
teaching S responses before learning a 
list of paired associates will facilitate the 
learning. In effect, when S learns A-B 
he has learned the responses required in 
learning C-B; therefore, the learning of 
C-B should be facilitated. In the pres- 
ent results, this positive factor is more 
than counteracted by negative effects. 
The question remains, therefore, as to 
the source of this interference. The best 
guess is that it results indirectly from 
backward learning. It is known (Feld- 
man & 1957) that as S 
learns A-B, an association develops be- 
tween B and A so that B will elicit A 
with much greater than chance proba- 
bility. It may be assumed that 


Underw ood, 


such 


backward associations developed in the 
learning of the first list in the present 
study; 
elicit A. 


B will have some tendency to 
The S then proceeds to learn 
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C-B. As this association develops, it is 
possible that when C is presented it 
makes S “think of" B, but that B (via 
backward association) makes him think 
of A (the stimulus in the first list) leading 
to an erroneous ‘‘conclusion’’ that B 
does not go with the present stimulus—C. 
It would not require many such con- 
fusions to produce the negative transfer 
observed. 


SUMMARY 


This study was designed to investigate 
the transfer effects in verbal paired-associate 
learning using a mixed design (in which 
subgroups of pairs in the second list form 
various transfer paradigms with pairs in the 
first) and an unmixed design (in which all 
pairs in the second list relate in the same way 
to the pairs in the first). Four paradigms 
were studied by each of these two procedures : 
A-B, A-Br (second-list pairs constitute a re- 
pairing of first-list pairs); A-B, A-C; A-B, 
C-B; and A-B, C-D. The latter paradigm 
constituted the control condition. For all 
paradigms for both mixed and unmixed 
designs all Ss learned the same second list 
The various paradigms were formed by 
changing the pairs in the first list. Eight 
groups of 18 Ss each were used. The first 
list of paired adjectives was presented until 
all items were anticipated on a 
single trial; the second or transfer list was 
presented for 10 trials 

The results show no difference in the trans 
fer effects as a function of mixed vs 
lists. Thus, the use of various paradigms 
among subgroups of items within a single 
list does not produce interactions among the 
transfer effects for the various paradigms 
It was concluded, therefore, that investigators 
may use either type of design without serious 
concern that the will be 
different. 

Using A-B, C-D asa control, the results 
show 


correctly 


unmixed 


transter efiects 


that all paradigms produce negative 
transfer throughout the 10 trials on the second 
list. The greatest negative transfer was 
produced by A-B, A-Br, the next greatest by 
A-B, A-C, and the least by A-B, C-B. A 
possible source of interference in the latter 
paradigm was discussed 
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INDIVIDUAL PATTERNS OF PHYSIOLOGICAL ACTIVITY 
AS A FUNCTION OF TASK DIFFERENCES AND 
DEGREE OF AROUSAL ! 
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A series of studies from this labora- 
tory, using a variety of experimental 
procedures, have been concerned with 
the relationship of physiological meas- 
ures to level of motivation (Barto- 
shuk, 1955; Malmo & Davis, 1956; 
Stennett, 1957b; Surwillo, 1956b). 
As indicated by Malmo, in a recent 
review (1957), the findings of these 
studies can be interpreted as being 
consistent with the formulation of an 
independent “arousal” (or intensity) 
dimension of behavior (Duffy, 1951; 
Hebb, 1955; Lindsley, 1951; Schlos- 
berg, 1954). Furthermore, the data 
are encouraging with respect to the 
proposition that physiological meas- 
ures may serve to quantify this 
dimension of arousal. 
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The main purpose of the present 
experiment was to attack this general 
problem from the point of view of 
individual differences, a topic that 
has recently been reviewed by Duffy 
(1957). In part, the aims of the 
present investigation were similar to 
those of Lacey and his co-workers in 
their studies of “‘autonomic response 
specificity’ (Lacey, Bateman, & Van- 
Lehn, 1953; Lacey & Lacey, 1958) 
and those of workers in our own labora- 
tory, in attacking the problem of 
“symptom specificity’’ (Malmo, Sha- 
gass, & Davis, 1950). The speci- 
ficity principles hold that individuals 
differ with respect to which physio- 
logical measures show the greatest 
change under standard conditions of 
stimulation. 

In extending earlier work in this 
area of individual differences, the 
present study differs from previous 
experiments mainly in three respects: 
(a) additional tasks have been em- 
ployed, (b) quantitative variation 
in arousal (intensity) level was in- 
cluded as a variable in addition to 
the qualitative differences from task 
to task, and (c) muscle action poten- 
tials were added to the autonomic 
measures. Previous work on speci- 
ficity with nonpatients had not in- 
cluded measures of the 
striate musculature. 

In addition to the new attack on 
the problem of physiological speci- 
ficity itself, there are other purposes 
of the present investigation that are 
chiefly bound up with our interest in 
the concept of arousal. In the first 
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place, there is a rather important 
methodological problem to be con- 
sidered. Some writers have expressed 
strong doubts about the possibility 
of finding any systematic relation- 
ships between a hypothesized unitary 
process such as arousal and various 
physiological measures, because of the 
lack of consistency in physiological 
reactions from individual to individual 
and from one situation to the next. 
However, the chief hypothesis of the 
present study is that although indi- 
viduals will indeed exhibit idiosyn- 
cratic patterns of physiological acti- 
vation, these patterns will be consist- 
ent for the individuals in qualitatively 
different situations, and in those 
designed to be differentially arousing. 

In addition to the matter of indi- 
vidual patterns of physiological acti- 
vation, the further question of indi- 
vidual differences in what may be 
called ‘‘general arousability’’ was also 
investigated. That is, the hypothe- 
sis was advanced that whatever their 
specific patterns might be, some 
individuals would characteristically 
show greater activation than others 
in all of the experimental conditions. 

Finally, experimental design in- 
cluded a plan to compare the physio- 
logical measures with respect to their 
apparent power to differentiate mean- 
ingfully among the different experi- 
mental conditions. This is the ques- 
tion of whether, despite individual 
differences, some physiological meas- 
ures appear superior to others in 
gauging level of arousal. 


METHOD 
Experimental Conditions 


Individual differences in physiological 
activity were studied by varying two factors: 
(a) the qualitative nature of the task, and 
(b) the noxiousness of the experimental situa- 
tion. An attempt was thus made to induce 
relatively low and relatively high arousal 
levels while Ss were doing visual pursuit 
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tracking and while they were doing arithme- 
tic. The conditions were as follows: “low” 
arousal tracking (LT), ‘“‘low’’ arousal arith- 
metic (LA), “high” arousal arithmetic (HA), 
and “high” arousal tracking (HT). Every 
S served in all four conditions. 

Prior to the main experimental trials, all 
Ss had four practice tracking trials. They 
were instructed to do as well as they could 
and were informed of their scores after every 
trial. 

In order to cope with the problem of 
administering a task to Ss and at the same 
time achieving a low level of arousal, the 
often used method of a ruse was adopted. 
The LT trials were presented to S as if they 
were not actually a part of the experiment. 
After the practice trials were completed, S 
was informed that during the next few trials 
he would be required to do some arithmetic. 
Then, before starting the arithmetic, E re- 
quested S to assist him with some adjust- 
ments and calibration of the equipment, 
which purportedly required S’s participation. 

To increase the level of arousal for the 
HT trials, S was required to track under 
conditions of intense auditory distraction and 
was told that he would receive a strong shock 
if he failed to improve his best score obtained 
in any of the previous trials. To impress on 
S that this was no hollow threat, a shock was 
administered 60 sec. before the beginning 
of the first HT trial. The S was informed 
that if he failed to improve his best score, 
he would receive a shock twice as strong as 
the “sample.” However, no shock was 
actually administered after a trial. After 
the first HT trial Ss were informed of a score 
that was about 10% worse than their best 
score but that they would be given another try. 

During the LA trials, S was required to 
perform multiplications of one digit numbers, 
e.g., 3 X 8 =? This was, of course, a task 
involving rote memory. 

During the HA trials S was required to 
perform more difficult multiplications, e.g., 
17 xX 18 = ? In addition to this, he was 
heckled if he provided a wrong answer cr 
failed to respond within 5 sec., e.g., ““This 
shouldn't take that long!’ “Hurry up now!” 
During the arithmetic trials Ss were required 
to hold the tracking pointer at its maximum 
deflection, in order to equate the muscular 
demands with those for the tracking. 


Subjects 
The Ss were 43 male university students. 


They were all righthanded, and ranged in 
age from 16 to 23 yr., with a median age of 19. 
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Apparatus 


Tracking system.—The apparatus used for 
visual pursuit tracking was a modified ver- 
sion of one previously reported by Surwillo 
(1956). The modification consisted of: (a 
major changes in circuitry,’ (b) the two micro- 
ammeters were mounted one above the other, 
instead of side by side, which slightly in- 
creased the difficulty of the task, and (c 
ball bearings were incorporated for the 
controller knob shaft to reduce friction. 

The display unit of the tracking system 
consisted of two pointers (indicators of the 
two microammeters). The axes of these 
apart. The position 
of the top pointer was varied by a function 
generator at a constant rate from left to 
right and back, over a total arc of 60°. A 
complete cycle of this function, from 0° to 60 
and back to 0°, lasted 1 min. The S's task 
was to control the bottom pointer, so that its 
reading was always equal to that of the top 
pointer. The torque required for a maximum 
deflection of the pointer was 5,000 gm.-cm. 
and the range of knob rotation was 8°. 

The controller knob contained a mechan- 
ism which permitted continuous recording of 
S's grip pressure. Surwillo (1956) has pro- 
vided a detailed description of its construction. 

Shock administration—A DC shock of 
100 v. was administered by a single charged- 
condenser stimulator. A charge in a two- 
microfarad condenser was discharged through 
electrodes, which were placed on the calf of 
S's left leg. The resistance of S through the 
electrodes was not more than 10,000 ohms. 

Auditory distraction 


pointers were 10 cm 


An intense auditory 
distraction was presented to Ss as follows 
By means of an audio oscillator, tones con- 
tinuously varying in frequency between 2,000 
and 3,500 cycles were recorded on a Gray 
\udograph disc. An amplified playback of 
this recording was fed through a speaker, 
which was directly 
behind S. 


located on the wall 


Physiological Variables 


Muscle activity.—Electromyograms (EMG 
from the ventral surfaces of right and left 
forearms (pronator teres), and from the right 
side of the back of the neck (semispinalis cap- 
itis and s pli nius capilt were recorded by an 

' The electronic circuitry of this system, 
which employed AC amplifiers instead of DC, 
designed by J]. F. Davis, is on file in the 
Laboratory for Psychological Studies, Allan 
Memorial Institute of Psychiatry, Montreal. 
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Offner Type D Electroencephalograph. De- 
tails of the placement and recording tech- 
nique used in this laboratory have been 
reported by Davis (1952). The electro- 
encephalograph was also used to record varia- 
tions in grip pressure. The muscle potentials 
and the grip pressure variations were inte- 
grated once every 4 sec., and recorded as 
spikes on a separate chart drive. 

All EMG measurements 
terms of microvolts from the integrated 
records. From preperiods three spikes were 
measured: 32-28, 20-16, and 8-4 sec. before 
the beginning of a trial. Ten spikes were 
measured for every trial: first, 4-8 sec. after 
the beginning of a trial and every third spike 
thereafter. For grip pressure the same 
sampling method was used, except that 
measurements were taken for trial periods 
only. Pressure was measured in grams. 

Heart rate-—A continuous recording of 
heart rate was obtained by an electrocardio- 
tachograph designed by Davis.‘ A _ chest 
placement was chosen to insure maximum 
freedom from EMG artifact. This and the 
subsequent variables were recorded on a third 
chart drive 


were made in 


From preperiods three measurements were 
made, at 30, 15, and 4 sec. 
ning of a trial. Eight measurements 
made during trials, one every 15 sec 
was expressed as beats per minute 

Blood pressure.—Systolic blood pressure 
was recorded using a modification® of an 
apparatus developed by Burns, Elliott, and 
James (1955 This method of 
required placing only a small 
middle finger of S's left hand 
device provided a 
every 4 se 

Three measurements 
preperiods, at 3O, 15, and 4 sex 
Nine measurements were made 
the first, 4 sec., the se 


before the begin- 
were 
Rate 


recording 
cuff on the 
The recording 
measure approximately 
made during 
before trials. 
during trials; 
ond, 15 sec. after 
‘Circuitry and description of this appara 
tus are on file in the Laboratory for Psycho 
logical Studies, Allan Memorial Institute of 
Psychiatry, Montreal 
5 The 
industrial air-pump was substituted for the 
aquarium type pump, (b) an eclipse Type DO 
valve was substituted for the 
solenoid valve, (¢ 


modifications consisted of: (a) an 


handmade 
in place of a mercury 
column and platinum wire, a Statham Type 
P23AA pressure transducer was 
Edin Type 5110 Strain Gage 
Calibrator, recycling 
feature was provided by the addition of a 
Mercoid Switch Type DA 31 


used with an 
Amplifier and 


and (d) an automatic 
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beginning of a trial, the others—-one every 
15 sec. Pressure was expressed in milli- 
meters of mercury 

Respiration.—Respiration was recorded 
using a standard type tambour and pneumo- 
graph made by Phipps and Bird. For the 
tracking trials rate was determined simply by 
counting the number of complete inspiration- 
expiration cycles. For preperiods, 30 sec. 
before the beginning of a trial were used. 

Because during arithmetic trials Ss gave 
the answers orally, rate was determined by 
sampling those periods when Ss did not talk. 
Five samples, each consisting of three com- 
plete inspiration-expiration cycles, were chosen: 
one right after the beginning of a trial, and 
the others—one approximately every 30 sec. 
This method was also used for some tracking 
trials where regular cyclical activity for the 
full trial period was lacking. Rate was ex- 
pressed in terms of the number of inspiration- 
expiration cycles per minute. 

Palmar conductance.—The circuit used for 
recording DC skin resistance in this labora- 
tory has been previously reported by Sten- 
nett (1957a). This technique involves a 
standard voltage comparator bridge with a 
constant (+2%) over the range of 
Resist- 
ance was measured between the palm and 
dorsum of the left hand. The S was grounded 
only through the lead from the dorsum. 
Resistance values were converted into mi- 
cromhos and all computations were done on 


current 
resistances observed in this study. 


these conductance values 

One measure was taken during preperiods, 
4 sec. before a trial. Five measurements were 
made during trials, the first, 4 sec., the second, 
30 sec. after beginning of a trial, the others 
one every 30 sec 

Skin temperature.—Variations in the tem- 
perature of the skin were recorded using a 
Yellow Springs Instrument Co. Model 45A 
Tele-Thermometer, a DC amplifier, and a 
recording galvanometer. The placement of 
the thermocouple was approximately 4 in. 
above the wrist on the ventral surface of the 
left forearm 

One measurement was made during pre- 
periods, 15 sec. after the beginning of a trial. 
rhree measurements made during 
trials, one at the beginning, one in the middle, 
and one at the end of a trial. Temperature 
was recorded in degrees centigrade. 


were 


Testing Procedure 


Upon arrival at the laboratory each S had 
a 30 min. interview with EZ. Biographical 


data were taken, the tracking task was 
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explained, and reassurance given concerning 
physiological recording. At the end of the 
interview the short form of the Manifest 
Anxiety Scale(Taylor, 1953) wasadministered. ® 

After the interview S was taken to the 
testing room and seated in a special chair 
with adjustable arm rests, which helped to 
reduce postural variations from S to S. 
Electrodes and attachments were applied, 
after which routine checks were made on the 
operation of the recording instruments. 

Each trial lasted 2 min., with a 3-min. rest 
period between trials. Instructions to the 
S were presented through a two-way inter- 
com. A warning to assume the tracking 
position was given 45 sec. before each trial 
(preperiod). Altogether there were 12 trials: 
4 practice tracking trials, and 2 trials each 
of the four experimental conditions. The 
entire experimental routine lasted approxi- 
mately 2 hr. The tasks were always pre- 
sented in the following order: practice track- 
ing, “low” arousal tracking, “low’’ arousal 
arithmetic, “high’’ arousal arithmetic, and 
“high” arousal tracking. A balanced order 
of conditions was not used because that would 
have introduced complications for any inter- 
individual comparison of the Ss. The length 
of the experimental session would then vary 
from S to S, because after a high arousal 
condition different Ss would require variable 
time intervals to recover from the effects 
of such conditions. 


Treatment of Data 


Means were computed for the preperiod 
and trial values from the two trials of each 
condition for each S. To make possible a 
direct comparison among the different vari- 
ables, which had been measured in different 
units, the means were converted into JT 
scores (Garrett, 1953). For statistical analy- 
sis two indicators of S's physiological activity 
were computed. One represented the level 
of activity of a particular function and con- 
sisted of S’s T score obtained from his trial 
values. The other indicator represented a 
displacement of a particular function, which 
was computed by subtracting a preperiod T 
score from the corresponding trial T score. 
However, since it was found that such abso- 
lute change values depended upon the pre- 
period values, a procedure which partialed 
out the correlation between the preperiod and 
trial scores was adopted (Lacey, 1956) 


6 This aspect of the study is not reported 
here. 
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TABLE 1 


COMPARISON OF MEAN PHYSIOLOGICAL Activiry LEVELS AMONG 
EXPERIMENTAL CONDITIONS 


Variable 


Heart rate 
Blood pressure 
Palmar cond 
Resp. rate 
Skin temp 
Right forearm 
Left forearm 
Nex k 


Grip pressure 


wuereuuns 
i) 


no 


* One-tailed test 


A level score of a particular function can 
be said to reflect the combined influence of all 
antecedent events (including instructions and 
anticipation) and the actual performance of a 
task, while a change score reflects the dis- 
placement in a particular function associated 
only with the performance during a task. 


RESULTS 


Physiological effects of experimental 
conditions.—Before presenting the re- 
sults related to the consistency of 
individual patterns of physiological 
activity, the over-all physiological 
effects of the experimental conditions 
will first be examined. Table 1 con- 
tains mean values for the nine meas- 
ures of level of physiological activity 


Condition 


during the four conditions. Freed- 
Siegel, 1956) 
analysis of variance for related sam- 
ples was used to test the reliability 


man’s (see two-way 


of the differences among the condi- 
tions. For all variables the differences 
are statistically significant. 

Table 2 presents the mean changes 
in the different functions associated 
with each data 
parallel those presented in Table 1 
Again, it can be seen that larger and 
statistically reliable displacements are 
associated with the “higher” than 
with the “lower” arousal conditions 

Patterning. 
individual’s pattern of physiological 


condition. These 


The consistency of each 


TABLE 2 


COMPARISON OF MEAN DISPLACEMENT IN PHYSIOLOGICAI 


Activiry ASSOCIATED 


With DirFERENT EXPERIMENTAL CONDITIONS 


oa 


Heart rate 
Blood pressurt 
Palmar cond 
Resp. rate 
Skin temp 
Right forearm 
Left forearm 


Neck 


tN 
or CONN OO 
nor onmnouw 


* One-tailed test 
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Fic. 1. Cases exemplifying consistency of 
patterns for physiological activity levels. 


activation from condition to condition 
was evaluated by computing Ken- 
dall’s (see Siegel, 1956) coefficients 
of concordance (W). In computing 
Ws, an individual's pattern was ex- 


pressed as the rank order of his T 


the different variables, 
ranked separately for each condition. 
For all variables, except skin tempera- 
ture, high 7 scores signify high abso- 
lute values and large increases. For 
skin temperature, high TJ scores 
were assigned to low absolute values 
and large decreases because it has 
been found that anxiety is associated 
with decreases in finger temperature 
(Mittelman & Wolff, 1939). 

For all Ss, except one, the Ws 
are statistically significant (P < .05). 
The Ws range from .995 to .336 with 
a median W of .801. Two representa- 
tive cases are presented graphically 
in Fig. 1. Note the dissimilarity 
between the patterns exhibited by 
these two Ss, while pattern 
shows considerable stability from con- 
dition to condition. 


scores for 


each 
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The Ws, derived from 7 
based on the change in activity levels 
associated with the experimental con- 
ditions, are statistically significant 
(P < .05) for 22 Ss, or 51%. In this 
case the Ws range from .802 to .227 
with a median W of .513. The re- 
sults based on change in physiological 
activity are not as decisive as those 
concerning the level of activity. 
Fewer Ss exhibit consistency in their 
patterns and in general the coefficients 
of concordance are lower. ‘To investi- 
gate the possibility that the low Ws 
are mainly due to the greater changes 
in physiological activity associated 
with the “high’’ arousal conditions, 
separate rank order correlations were 
computed between the two “low” 
and the two “high” arousal condi- 
tions. This was done only for Ss 
who did not exhibit significant Ws. 
Of the 42 such only 
one’ was significant at the .05 level. 
It is thus evident that these Ss did not 
exhibit any more consistent physio- 
logical patterns during the ‘‘low”’ than 
during the “high’”’ arousal conditions. 

Intra-individual variability of phys- 
iological functions.—The pre- 
sented in Fig. 1 also illustrate the fact 
that there is a considerable amount 
of intra-individual variability in the 
different physiological measures. 


scores 


correlations 


cases 


For 
example, S No. 2 is below average in 
skin conductance, about 
heart rate, but consistently above 
average in neck muscle action poten- 
tials. On neck muscle 
action potentials this S shows high 
physiological activation, but in terms 
of skin conductance his 
activation is low. 


average in 


the basis of 


level of 


If a group of Ss is exposed to the 
same experimental conditions, one 
might expect that all Ss would not be 
equally aroused, i.e., some Ss would 
higher over-all physiological 
activity than others. In terms of 


show 
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TABLE 3 


P VALUES FOR COMPARISONS OF MEAN PHYSIOLOGICAL Activity LEVELS 
BETWEEN EXPERIMENTAL CONDITIONS 


Variable 

LT vs. HT@ 
Heart rate 001 
Blood pressure 001 
Palmar cond : 01 
Resp. rate 001 
Skin temp ’ 01 
Right forearm .001 
Left forearm 05 
Neck N.S. 
Grip pressure N.S 


* One-tailed test 


individual patterns of physiological 
activity this means that Ss might 
differ with respect to the general 
height of their physiological patterns. 
That is, even though an S is high on 
one measure and low on another, it is 
possible that he is high (or low) on 
most measures, rather than having 
approximately as many values above 
the group mean as below. To test 
this, we would have to see whether 
the various physiological functions of 
each S were distributed around the 
group mean, or whether they were 
instead distributed around a personal 
mean, which could be either higher 
or lower than that for the group. A 


statistical test was done by computing 
an over-all x’. 


The null hypothesis 
in this case was that most Ss have 
as many values above 50 as below, and 
therefore, that one cannot differenti- 
ate among individuals in 
their over-all 
levels. Using 


terms of 
physiological activity 
level the ob- 
tained x? was 5.90, which is significant 
beyond the .001 level. The analo- 
gous x? for change scores is 1.60, which 
is significant at the .05 level. Hence, 
the null hypothesis must be rejected. 

Effectiveness of various measures in 
differentiating among conditions.—This 


scores, 


LA vs. HA* 


Comparison 


LT vs. HA* | LA vs. HT¢ | LT vs. LA HT vs. HA 


N.S 


was evaluated by computing a coef- 
ficient of concordance for each 5S. 
Previously, for the pattern analysis, 
the Ws were computed by comparing 
the rank-orders of physiological meas- 
ures for the four experimental condi- 
tions. In this the Ws were 
computed by ranking the four condi- 
tions for each physiological measure. 
With respect to the level of physio- 
logical activity the Ws are significant’ 
(P < .05) for 24 Ss (56%). The Ws 
range from .041 to .733 with a median 
W of .317. For the change in physio- 
logical activity the Ws are significant 
only for 7 Ss (16%). The Ws range 
from .005 to .581 with a median W of 
.182. However, even the latter dis- 
tribution of Ws differs significantly 
(x? = 6.99, P < .01) from the ex- 
pected distribution if the population 
coefficient were zero (Lacey & Lacey, 
1958). It 


case, 


can be seen, therefore, 


that the various physiological meas 
ures correlated to 


are a significant 


degree. Some Ss, however, show the 

7 There is no reason to expect physiological 
differentiation between the “low” 
between the two “high” arousal conditions, 
therefore, the size of these Ws is generally 
smaller than if different arousal levels had 
been induced in each of the four conditions 


two and 
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predicted changes only in a few func- 
tions while others show them in all 
or most of the functions used in this 
study. 

Finally, the various physiological 
functions were examined separately 
to compare their efficiency in differen- 
tiating among conditions. Table 3 
contains the separate probabilities for 
mean differences of all measures ob- 
tained by making comparisons among 
the experimental conditions taken 
twoatatime. ‘The first four columns 
represent the four possible compari- 
the two “low” and the 
conditions. In 


sons among 
two “high” arousal 
the last two columns are presented 
comparisons within the “low” and 
“high”’ conditions. It can be seen 
that only four measures consistently 
differentiate between the two ‘‘low”’ 
and the two “high” arousal conditions. 
These measures are heart rate (72% 
of Ss showed faster heart rate during 
both “high’’ arousal conditions as 
compared with the highest of the 
“low” arousal conditions), systolic 
blood pressure (72%), respiration rate 
(64%), and right forearm muscle 
tension (51%). The relatively small 
percentage values were obtained be- 
cause Ss often failed to differ in the 
predicted direction on only one out 
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of the four possible “low’’—‘‘high” 
comparisons. When the above four 
measures were combined (16 ‘“‘low”’ 
“high’’ comparisons for each S) one 
could differentiate reliably (P < .05) 
between the “low” and “‘high”’ arousal 
conditions for 86% of the Ss. The 
other measures fail to differentiate 
significantly at least between one of 
the “low” and one of the “high” 
arousal conditions. 

Only three measures differentiated 
significantly between the two “low”’ 
arousal conditions. These were sys- 
tolic blood pressure, skin conductance, 
and grip pressure; the values being 
significantly higher during the arith- 
metic as compared with the tracking 
trials. The two “high” arousal con- 
ditions also showed significant differ- 
ences on three measures. However, 
although left forearm and neck muscle 
tension were significantly greater dur- 
ing the tracking trials, Ss showed 
significantly higher skin conductance 
during arithmetic trials. 

The probabilities yielded by com- 
paring any two conditions for the 
amount of change are presented in 
Table 4. The large number of small 
probabilities, especially for the meas- 
ures which showed high differentia- 
tion in terms of level scores, indicate 


rABLE 4 


P VALUES FOR COMPARISONS OF 


MEAN CHANGES IN PHYSIOLOGICAL ACTIVITY 


BETWEEN EXPERIMENTAL CONDITIONS 


Heart rate 
Blood pressure 
Palmar cond 
Resp. rate 
Skin temp 
Right forearm 
Left forearm 
Neck 


# One-tailed test 


Comparison 


HAs | LA vs. HT? | L 


001 001 
001 N.S. 
001 N.S 
001 001 
N.S N.S. 
001 .001 
N.S. 001 
N.S N.S 
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that the probabilities in Table 3 are a 
result of the different arousal func- 
the two types of condi- 
tions rather than only to the differ- 
ences in the “alerting’’ response itself 
(instructions, anticipation). 


tions of 


DISCUSSION 


The hypothesis that during qualita- 
tively different stimulus 
exhibit 
stereotyped 


situations Ss 
idiosyncratic but highly 
patterns of somatic and 
autonomic activation is clearly supported 
by the data. The same S might show 
below-average levels of activity in some 


will 


functions, average in others, and above- 
average in still others. 
what the 


But no matter 
shape of the idiosyncratic 
pattern, Ss tend to respond with their 
specific hierarchy of activation to situa 
tions which require different overt re 
sponses. Even variations in ex 


perimental situations produce increases 


when 


in the over-all level of activation, signifi- 
cantly stereotyped physiological res 
ponses are observed 

consistent with 
those previously reported by Lacey et al. 
(1953) and Lacey and Lacey (1958) who 
formulated the principle of 
specificity’’ using autonomic 
In view of the 


These findings are 


“response 
variables. 
present results such a 
principle would seem to apply not only 
to the functioning of the autonomic 
nervous system but to the _ skeletal- 
muscle system as well. 

The fact that during the high arousal 
trials a general increase in physiological 
activity was observed would seem to 
support the notion of a general arousal 
state. However, any given level of 
arousal is reflected differentially in dif 
ferent individuals. Such differentiation 
is contrary to the traditional 
classifications, such as 


matic, 


broad 
autonomic-so- 
sympathetic-parasympathetic, 
since differentiation did not follow such 
classifications but was evident even with 
in cardiovascular somatic 


and systems 


themselves. This lack of homogeneity 
indicates a need for a new approach in 


our thinking about the functioning of 


ACTIVITY 


the nervous system. Recently, Wenger 
(1957), in discussing some new 
representing the activity of the 
nomic nervous 


data 
auto- 
system, drew a similar 
conclusion. 

In general, the results of this study are 
in accord with the notion of an arousal 
continuum and support the suggestion 
that peripheral physiological 
can be used to measure 


variables 
changes in 
arousal. Higher levels and greater in 
physiological activity were 
observed during the “high’’ than dur- 
ing the “low” One 
might, however, pose the question whether 
varying degrees of arousal were actually 
induced in this experiment. 


crease in 


arousal conditions. 


To avoid 
circularity, one obviously can not use the 
physiological data themselves in support 
of an affirmative answer. A recourse to 
other observables is necessary. 

The arousal theory implies that degree 
of arousal is a function of task difficulty 
and intensity of 
symbolic) (Duffy, 
things 


stimuli 
1951, p 
being 


(physical or 

32). That 
equal, during 
difficult tasks arousal is expected to be 
higher than tasks, and 
noxious situations are expected to pro 


is, other 


during easy 
duce higher levels of arousal than rela 
tively innocuous situations. It appears 
that the experimental conditions satisfy 
Conse 
data 
as supporting the prediction that in 


creases in 


these requirements adequately 


quently, one can interpret the 


arousal will be associated 

with increased physiological activity. 
Before continuing with this question 

of applying 


gauge 


physiological 


level of 


measures to 


arousal, an important 


methodological distinction deserves con 
sideration: it is 


essential to be clear 
whether the comparisons of arousal level 
are inter- or intra-individual in character 

In comparison with others one S may 
show | and 
The state 


ment that an S’s heart rate is higher than 


relatively high neart rate 


relatively low muscle tension 
of course, an 


the heart rate of others, is, 


interindividual comparison. Similarly, 
the statement concerning his relatively 
low muscle tension is also based on com 
parison with other Ss 


to othe 


ma\ 


be high relative rs in one physio 
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logical function and low (again relative 
to others) in another physiological func- 
tion is what we should expect from 
findings of idiosyncratic patterns of 
physiological activity. Such individual 
differences would also account for the 
low intercorrelations among physiological 
measures that have been reported by Ax 
(1953) and others (Lacey & Lacey, 1958; 
Malmo & Davis, 1956; Malmo & Smith, 
1955). 

But there is a point that, curiously 
enough, appears to have been overlooked 
by some writers. Despite his idiosyn- 
cratic pattern, an individual placed in an 
arousing situation will, according to the 
results of the present study, probably 
show a general increase in most physio- 
logical functions. The hypothetical case 
cited above would be expected to show 
increase in heart rate and also in level 
of muscle tension, even though, in com- 
with muscle tension 
remains low, and heart rate is still high. 
It is possible, therefore, to obtain a zero 
correlation 


parison others, 


variables 
when the correlation is computed among 
Ss, while the between the 
same two variables can be perfect if 
computed within an S. Present data 
indicate that, although the correlations 
are generally low among Ss, there is 
considerably closer correspondence among 
different physiological measures within 
the same S.* For self- 
control design (in same Ss 


between any two 


correlation 


these reasons, 
which the 
are employed in all conditions) appears 
superior to design employing independent 
groups. 

Another point im- 
pressed on us by present findings is that 
whether comparisons are intra- or inter- 


method logical 


individual in character, there are signi- 


ficant tactical advantages in employing 
several 
than in one or two. 
The large interindividual differences in 
activation, seen in all physiological func- 
tions, indicate that an individual's level 
of arousal, in a given situation, can not 


physiological measures rather 


relying on only 


8 Such correspondence among physiological 
measures is more pronounced with respect 
to the direction of change, rather than the 
amount of change. 
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be reliably inferred from a single physio- 
logical measure (somatic or autonomic). 
One would obtain a different ordering of 
Ss with respect to degree of arousal, 
depending on which physiological meas- 
ure was chosen as the criterion. Al 
though any one physiological measure 
appears to be an unreliable criterion 
of the general arousal level, by using a 
number of measures in combination, it 
seems highly probable that one can 
differentiate reliably among individuals. 
Some individuals show characteristically 
greater physiological activation than 
others in response to standard situations. 
In addition, even for intra-individual 
comparison the use of several physio- 
logical measures seems to be indicated, 
since individuals vary considerably in the 
amount of correspondence that exists 
among physiological measures and the 
significant concordance coefficients sel- 
dom approach 1.00. 

Recently, several studies have been 
reported where the relationship between 
physiological activity on one hand, and 
MAS scores and performance data on the 
other hand, has been investigated (Beam, 
1955; Berry & Martin, 1957; Calvin, 
McGuigan, Tyrrell, & Soyars, 1956; 
Raphelson, 1957). The results of this 
study suggest that the failure to find 
significant correlation between physio- 
logical activity and MAS, and perform- 
ance might be expressly due to the unde- 
pendability of a 
measure as an 


single physiological 
indicator of 
phy siological activity. 

In addition to the individual differ- 
ences discussed above, it was found in the 
present study that, for purposes of 
distinguishing among arousal levels, some 
physiological functions seem to be more 
adequate than others. Such differences 
among the physiological 


general 


measures ex- 
emplify the differential sensitivity of the 
various functions. It appears that a 
given change in the general arousal state 
might be reflected consistently in some 
peripheral bodily functions and not in 
While heart rate, systolic blood 

respiration and right 
differentiated 
consistently among conditions, skin con- 


others. 
pressure, 
forearm 


rate, 


muscle tension 
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ductance, skin temperature, left forearm 


muscle tension, and grip pressure failed 
to differentiate at least 
the “low” and one “‘high”’ arousal condi- 
tion. By combining the first four meas- 
ures more reliable differentiation between 


between one of 


the arousal levels was possible than on 
the basis of any one of these measures 
alone. 

In terms of the total arousal 
tinuum, arousal levels ‘exhibited by Ss 
of this study would to represent 
moderate degrees of arousal. Such an 
inference is based on two considerations. 
arousal 
conditions Ss were required to perform 
a task. Second, the “high” 
conditions were not of great enough 
intensity breakdown of 
there 


con- 


seem 


First, even during the “low”’ 
arousal 


to produce a 
fact, 
changes in 
heart 
rate, 


performance; in 
striking 
Accepting 


were no 
overt behavior. 
blood 
right 
(active) muscle tension as criteria, it was 


rate, pressure, 


respiration and forearm 
possible to discriminate reliably between 
the ‘“‘low”’ and “high”’ arousal conditions. 
This study that 
changes in the degree of arousal in the 
middle range of the total arousal con- 
tinuum can be differentiated effectively 
by the 
physiological functions. 


indicates, therefore, 


combined use of these four 
It is very likely 
that different criteria might have to be 
used, depending on the amount of change 
of arousal one wants to detect, and in 
arousal continuum 


which part of the 


the given changes occur. 


SUMMARY 


The chief purpose of the present study was 
to investigate the consistency of individual 
activity. In 
addition, some of the problems related to 
the measurement of arousal were also evalu- 
ated. 
corded. 


differences in physiological 


Nine physiological functions were re- 

Variations in experimental situation 
were produced by employing tracking and 
arithmetic tasks with two different 
arousal for each task 

Results showed that during qualitatively 
and quantitatively different stimulus situa- 
tions individuals exhibited idiosyncratic but 
highly stereotyped patterns of somatic and 
autonomic activation. 


levels of 
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The present results further support earlier 
work which 
singularly 
reliable indicants of 
cent of Ss 


found physiological measures 
useful in providing objective and 
Fifty-six per 

correlation 
There 
were four measures (heart rate, blood pres- 
sure, and right 


muscle tension) which consistently 


arousal 
showed significant 


among the physiological measures 
forearm 
differen- 
tiated between high and low arousal condi- 


respiration rate, 


Present evidence suggests, therefore, 
that these four measures may be maximally 
useful as indicants of arousal, at least within 
the range of arousal levels studied Although 
data from other experimental situations are 
required in order to reach a firmer conclusion 
concerning the 


tions. 


relative usefulness of these 
measures, it seems certain that several indices 
are superior to only one or two in gauging 


level of arousal. 
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CHANGES IN GSR TO A SINGLE STIMULUS AS A 
RESULT OF TRAINING ON A COMPOUND 
STIMULUS! 


WILLIAM W. GRINGS anp VSEVOLOD N. SHMELEV 


University of Southern California 


The study reported here is one of 
a series exploring relations among 
learned reactions to simple stimulus 
compounds and their elemental com- 
ponent stimuli. Classical condition- 
ing of the GSR has been used through- 
out, with colored dots (single or 
paired) as visual conditioned stimuli 
and electric shock as the UCS. The 
major relations previously studied 
included response transfer from single 
stimuli to pairs (Grings & O’Donnell, 
1956), transfer from paired stimuli 
to components, and the influence of 
preconditioning variables (Marks & 
Grings, 1958). 

The present experiment evaluates 
the assumption that training of stimu- 
lus components as members of a 
compound may alter the response 
magnitudes previously conditioned 
to the components separately. The 
specific hypothesis tested is that when 
an organism is trained to respond to 
individual component stimuli, and 
is subsequently given training on a 
compound composed of these com- 
ponents, the response evocation po- 
tentials of the components will under- 
go a change. 

It is predicted further that when 
the compound stimulus is reinforced, 
the effect on component response 
tendencies will be opposite in direc- 
tion to the effect when the compound 
stimulus is presented without rein- 
forcement. 
transfer 


On the basis of simple 


from components to com- 
' This study was supported by a grant 
(G-925) from the National Science Founda- 


tion. 


pounds it would be predicted that 
reinforcement of a compound would 
add to the response potentials of the 
components contained in the com- 
pound. On the other hand, presen- 
tational experience (nonreinforcement) 
of the compound would lead to 
extinction of the component reactions.” 

Briefly, the experimental procedure 
was as follows. GSR ‘were estab- 
lished to four visual stimuli by classi- 
cal conditioning techniques. Two of 
the stimuli were then combined in a 
compound on which further training 
was given. The other two were used 
as control stimuli. For one random 
group of Ss the training to the com- 
pound was with reinforcement present 
(hereafter called reinforcement experi- 
ence); whereas with a second group 
of Ss training to the compound was 
with reinforcement absent (hereafter 
called presentational experience). Fi- 
nally a test was administered on all 
four individual stimuli to note the 
effects of compound training. 

To the knowledge of the writers 
the only experimental evidence on 
this question comes from the labora- 
tory of Bekhterev. As reported by 
Razran (1939) this group found that 
the conditioning of a compound was 
achieved at the expense of the gradual 
elimination of component condition- 
ing, the weaker components being 


? An alternative hypothesis to those posed 
here might be that the Ss would readily dis- 
criminate between presentation of components 
and compounds, and therefore the experience 
with compounds would have no effect upon 
the response potential of the individual com- 
ponents. 
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eliminated 
Light and sound 
ployed with 
drawal to 
response. 


first, last. 
em- 
with- 


the 


the stronger 
stimuli were 
human Ss and 


electric shock was 


METHOD 


Subjects.—Forty volunteers, all under- 
graduate students of the University of South- 
ern California, were employed as Ss. Each 
was reimbursed for his participation. All had 
normal color vision as tested by the Ishihara 
test. 

A pparatus.—Conditioned stimuli were 
small colored dots produced by projection 
through Wratten filters onto a ground glass 
screen in front of S. The four colors, red, 
green, blue, and yellow, were obtained from 
standard Wratten filters and were matched 
for brightness. Time relations among the 
stimuli were controlled electronically with the 
CS preceding the UCS by .5 sec. and remaining 
on .5 sec. after the onset of the UCS. The 
UCS was a_condenser-discharge electric 
shock to the forearm. The GSR was picked 
up as a DC resistance change from the palm 
and the back of the nonpreferred hand. A 
more detailed account of the basic instru- 
mentation and laboratory situation is given in 
a previous study (Grings & O’ Donnell, 1956). 

Preliminary procedure.—The S was seated 
in front of a screen built into a partition that 
separated E and the equipment from him 
After a preliminary interval of 15 min. for 
electrode stabilization and apparatus calibra- 
tion each of the four colored stimuli was pre- 
sented to S in a random order to acquaint him 
with the stimuli. The shock was then ad- 
justed in intensity to achieve a double cri- 
terion: the consistent production of a GSR, 
and a verbal expression of slight discomfort by 
S. After three shock sensitization trials, S 
was instructed in his ‘‘task”’ for the remainder 
of the experiment. This was to depress an 
ergograph trigger with the index finger of his 
preferred hand when E said the word “ready,” 
and to keep a steady pressure on the trigger 
until the visual stimulus went out. The S 
had previously been informed that the purpose 
of the experiment was “to determine the 
nature of your skin electricity changes while 
we present colored stimuli on the screen in 
front of you.” He had also been told that the 
shock provided a “standard sized change”’ for 
use as a reference value 

Main procedure.—The major experimental 
sequence contained three parts: (1) a period 
of training on single stimuli, (2) a period of 
training on a compound or pair of stimuli, and 
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For Parts 
1 and 3 all Ss were treated alike, whereas for 
Part 2, Ss were assigned randomly into ex- 
perimental subgroups. The following dis- 
cussion of procedure details is organized ac- 
cording to the chronology of the three parts 
listed above. 

In Part 1, the period of training on single 
stimuli consisted of six presentations of each 
of four colored lights, three colors being rein- 
forced (paired with shock) and one not rein- 
forced. Presentation order was controlled by 
assigning each color randomly to two posi- 
tions in each block of eight trials. The as- 
signment of colors to reinforcement-nonrein- 
forcement conditions was varied systemati- 
cally from S to S. Except for the first 11 Ss 
the component training segment was con- 
cluded by a single presentation of each color 
without reinforcement. 

Part 2 was devoted to training on a stim- 
ulus compound composec of two of the pre- 
viously reinforced single colored dots pre- 
sented side by side as a pair. 


(3) a test series on single stimuli. 


One member of 
the pair was chosen at the outset to represent 
the effects of compound training for purposes 
of data analysis. That component was refer- 
red to as the included stimulus. The third 
reinforced component from Part 1 was not 
presented during Part 2. It was used later 
as a reference stimulus and was referred to as 
the nonincluded stimulus. The main depend- 
ent variable in the study was the difference 
between responses to the included and non- 
included stimuli during the final test segment. 
The fourth color, which had been presented 
without reinforcement during the component 
training segment, was also not presented in 
Part 2. It was used later only as a control 
stimulus to evaluate the presupposition that 
learning had occurred. 

For the assignment of compound training 
conditions to the Ss the total sample was 
divided randomly into two groups. Group I 
(reinforcement group) received eight rein- 
forced trials on the compound stimulus while 


Group II (presentational experience group) 
received eight nonreinforced trials. 
to provide data from which to plot curves of 


In order 


GSR to the included stimulus during the 
course of compound stimulus training, each 
major group was divided randomly into two 
subgroups of 10 Ss each. Subgroup A in 
each case received compound training without 
interruption, while Subgroup B had a test 
trial on the included stimulus interspersed 
between the fourth and fifth compound train- 
ing trials. 

Part 3, the final test series, included the 


presentation of each component stimulus 
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twice without reinforcement. This part 
immediately followed the compound training 
part and for purposes of analysis was broken 
up into a first posttest which consisted of the 
first presentation of all four colors in a ran- 
dom order, and a second posttest which con- 
sisted of the second presentation of the same 
stimuli in a second random order. 

At the end of the test series S was asked 
to report his reactions concerning the nature 
of the experiment. These interviews were 
either recorded on tape or summarized and 
written down by E. 


RESULTS 


The first step in the treatment of 
the data involved the transformation 
of the GSR into the unit square 
root of conductance change. This 
unit had been found in previous 
studies (Grings & O'Donnell, 1956) 
to have certain advantages. The 
major analysis to be presented here 
concerns a dependent variable which 
is the difference in response to the two 
crucial stimuli, the included stimulus 
and the nonincluded stimulus, as 
previously defined. Both of these 
stimuli were originally reinforced and 
one of them, the included one, was 
given compound training while the 
other was not. The first evaluation 


TABLE 1 


MEAN DIFFERENCES IN GSR TO INCLUDED 
AND NONINCLUDED STIMULI FOR THE Two 
CoMPOUND TRAINING GROUPS @ 


(N = 20 for each group.) 


First 
Posttest 


Second 
Group 
ties Posttest 
Reinforcement of 
compounds 


Presentation of 
compounds 


t ratio 


P 


“Unit: square root of conductance change in .01 
micromhos In each case the GSR to the reference 
(nonincluded) stimulus has been subtracted from the 
GSR to the experimental (included) stimulus 
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involved a ¢ test of the difference 
in the dependent variable due to 
the kind of compound training (rein- 
forcement vs. presentational experi- 
ence). These results are presented 
in Table 1 where it can be seen that, 
for the group with the reinforced 
compound (Group I), on the first 
test trial the response to the included 
stimulus is larger than that to the 
nonincluded stimulus (the difference 
variable is positive). With the pres- 
entation group (Group I1) the reverse 
is true—the response to the included 
stimulus is smaller than the response 
to the reference stimulus (the difference 
variable is negative). 

The group differences on the first 
test trial in Table 1 are in the direction 
that would be expected if reinforce- 
ment experience with the compound 
increased the response tendency while 
presentation experience reduced, or 
extinguished, such response tendency. 
It will be noted from the table that 
the difference between the two groups 
is significant at the 2% level (two- 
tailed test). However, it will also be 
noted that this. group difference 
reverses itself on the second test trial 
where, for both groups, the response 
to the included stimulus is less than 
that to the reference stimulus. This 
difference is not significant.’ 

The nature of these effects may be 


seen graphically by observing the 


magnitude of the experimental (in- 
cluded) stimulus throughout the com- 
pound training and posttest periods. 
Results from the subgroups for which 
a test trial was introduced midway 


during compound training are shown 


* Because the distribution of GSR 
moderately skewed even after transformation 
into square root of conductance change, all ¢ 


was 


tests were paralleled by nonparametric analy 
For within-S compari- 
sons the Wilcoxon matched-pairs signed-ranks 


ses of the same data 


test was used and for between-groups the 


Mann-Whitney 


test was employed. 
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GROUP | 
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— 


MAGNITUDE OF G.S.R. (VAT) 








TEST PERIOD 


Fic. 1. Comparison of median GSR 
magnitudes for the included stimulus at 
points: (1) after four trials on compounds; 
(2) after eight trials on compounds, or the 
first posttest trial; and (3) on the second 
posttest trial. N = 10, in each group. 


in Fig. 1. The first point indicates 
the response at the middle of com- 
pound training; the second point at 
the end of compound training (or 
the first test series); and the third 
point at the second test series. Me- 
dian values are plotted and support 
the conclusion that a difference be- 
tween groups appears at the end of 
compound training but disappears 
after one test series. In agreement 
with Table 1 the difference between 
the means on Point 2 of the figure 
is significant at the 5% level (¢ = 2.20; 
df = 18). 

* Interpretation of the above tests 
rests on two basic presuppositions: 
that Groups I and II] were not differ- 
ent at the outset of compound train- 
ing and that learning of some sort 
had occurred to the reinforced com- 
ponents during the pretraining period 
on single stimuli. To test the first 
of these, the differences between the 
GSR on the included and reference 
stimuli were computed for the test 
trial following the initial component 
conditioning. When this value 
(which is comparable to the variable 
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in Table 1) was compared for the 
two groups, no difference was found 
(t = .63). 

Similarly, to evaluate the pre- 
supposition of discrimination learning, 
a comparison was made of the rein- 
forced reference stimulus and the 
fourth stimulus component during 
the two posttest series when both 
were presented. If effects of dis- 
crimination learning persisted until 
the posttest period, the GSR to the 
reinforced reference stimulus (the so- 
called nonincluded stimulus) should 
be greater than the GSR to the fourth 
component which had been presented 
previously only during training on 
individual stimuli and had not been 


associated with shock. Groups | 


and II were combined for this com- 
parison and a significant difference 
in the expected direction (supporting 
the assumption of discrimination learn- 
ing) was found (¢ = 2.01; df = 38). 


DISCUSSION 


The analysis of results presented above 
suggests the conclusion that when stimuli 
are originally trained as individual com- 
ponents and trained as 
compounds, their response potential (as 
components) may undergo a change as a 
result of the compound training. At 
this point it appears best to conclude 
that positive transfer results from rein- 
forcement training and negative transfer 
(or extinction) from presenta- 
tional or nonreinforcement experience. 
It is also apparent that these effects, 
at least for the GSR, are not stable and 
influenced readily by extinction 
factors. 


subsequently 


results 


are 


Transfer of the type described is quite 
what one would expect from principles 
The sugges- 
tion of reversal from first to second test 
trials, appear to be at 
odds with such an explanation. Em- 
phasizing the fact that human Ss are 
involved, the writers are inclined to rely 
on factors of voluntary 


of classical conditioning. 


however, could 


response, set 
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change, or change in perception of the 
experimental With - self-in- 
structing, organisms it is 
presumed that during preliminary train- 
ing Ss were led to anticipate that certain 
colors would be followed by an electric 
shock. reinforcement training 
on compounds the anticipation of shock 
was being strengthened for the included 
stimuli, whereas during nonreinforce- 
ment training on compounds the antic- 
ipation of shock was being weakened. 
As a result, on the first test trial after 
compound training the effects of simple 
reinforcement-nonreinforcement transfer 
were evident. However, the experience 
of all stimuli without shock on the first 
test trial created in S an ambiguous 
perceptual situation for subsequent test 
trials. 

The empirical fact of rather dramatic 
change in GSR magnitude from first 
extinction trials has been 
reported before (e.g., Grant & Schiller, 
1953). The importance of verbal and 
self-instruction 


situation. 
verbalizing 


During 


to second 


factors on the learning 
and extinction of this particular response 
has also long been emphasized (Cook & 
Harris, 1937; Humphreys, 1940; Wood- 
worth, 1947). 

If the above ‘‘perceptual”’ conditions 
are true, other evidence might be ex- 
pected from the data. One such indica- 
tion was explored but led to inconclusive 
results. It employed information from 
the postexperimental interview on the 
basis of which Ss classified 
according to whether they were aware 
of stimulus relations at 


could be 


the conclusion 
of the experiment or were unaware of 
them. Assuming that unaware Ss were 
reacting less to their own verbal con- 
ception of what was going on than were 
the aware Ss, it was predicted that the 
unaware Ss reversal 
between the first and second test series 
and that such an effect should be present 


would show less 


only in Group I (since only that group 
experienced 
experience at 


a change in reinforcement 
the time of final test) 
A tabulation of data according to these 
awareness showed that the 
average change in the dependent variable 


conditions 
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from first to second test series was largest 
for the aware Ss of Group I and smallest 
for the unaware Ss of the same group. 
A Mann-Whitney test of this difference, 
however, was not significant 


SUMMARY 


A test was made of the prediction that 
when an organism is trained to individual 
component stimuli and is subsequently given 
training on a compound composed of these 
components, the poten- 
tials of the components may undergo a change 
The magnitude of this change was studied as 
a result of (a) reinforcement of the compound 
and (b) mere presentation of the compound. 

The results, with 40 Ss and GSR condi- 
visual stimuli, support the ex- 
pectation of a change in the direction of 
greater magnitude of response with reinforce- 
ment experience in the compound and less 
magnitude of response with presentational 
experience. The fact that 
appeared between first and second test series 


response evocation 


tioning to 


these results dis- 
was discussed as due to a combination of 
reinforcement and perceptual factors. 
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EFFECTS OF THREAT OF SHOCK, DISTRACTION, 
AND TASK DESIGN ON PERFORMANCE! 


ROBERT E. MURPHY? 
Johns Hopkins University 


This research is an attempt to 
integrate two different but closely 
related lines of research in psychology. 
The first covers the investigations 
of the effects of psychological stress 
on performance; the second covers 
the studies of the effects of different 
task designs on operator efficiency. 

Stress studies and task design 
studies have not usually been thought 
of within the same framework. In- 
vestigations of the effects of stress on 
performance have typically empha- 
sized the manipulation of factors ex- 
ternal to the task to be performed. 
Subjects have been threatened with 
failure, given false norms, distracted 
by all sorts of extraneous stimuli, and 


have been placed in various situations 
which have seemed to experimenters 


to be stressful. Though there are 
exceptions (Gates & Rissland, 1923; 
Hurlock, 1924; Verville, 1946), the 
great majority of the studies reviewed 
by Lazarus, Deese, and Osler (1952) 
report that performance is impaired 
as a result of psychological stress. 

In contrast to the stress studies, 
research on the effects of task designs 
on operator efficiency has been mainly 
concerned with the manipulation of 
the task itself. The critical consider- 
ation in these studies is the nature of 
the demands made upon the operator 
by the task design. For example, in 

1 This article is based on a dissertation 
submitted to Johns Hopkins University 
in partial fulfillment of the requirements for 
the Ph.D. degree. The writer is indebted to 
W. R. Garner and L. S. Otis for their many 
helpful criticisms and advice. 

2 Now at the Wright Air Development 
Center, Wright-Patterson A. F. B., Ohio. 


the design of visual displays, some of 
the important considerations seem 
to be the amount of search required 
to perceive relevant cues, the amount 
of interpolation required, the degree 
to which elements in the display 
conform to expected or learned rela- 
tionships, and the degree to which 
cues are confused. The efficiency of 
the operator varies as the number and 
complexity of the demands made 
upon him are changed. Lazarus, 
Deese, and Osler (1952) have sug- 
gested that a task itself may be 
inherently stressful due to such de- 
mands. This type of stress factor 
is not unlike that which Conrad 
(1950) has referred to as ‘‘load stress.”’ 

In line with this analysis, then, two 
types of stress may be distinguished 
on the basis of the conditions manip- 
ulated to induce them—that induced 
by varying conditions external to 
the task, and that induced by the 
task design itself. This distinction 
raises the question as to what might 
happen when these two types of stress 
are varied concomitantly. On the 
basis of previous findings, stress in- 
duced by factors external to the task 
should result in a performance decre- 
ment. One also expects differences 
in task design to produce differences 
in performance. 

The issue of primary concern in the 
present study, however, is the nature 
of the interaction of these two sources 
of stress. It seems possible that 
combining two forms of stress 
external and task induced—will pro- 
duce a combined effect which is not 
simply additive but appears in the 
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form of an interaction, such that the 
differential effect of one type of stress 
will be increased with the addition 
of the other type. This suggestion is 
supported by studies of Chapanis 
(1954), Conrad (1950), and Mack- 
worth (1952) which show that the 
effect of a given increment of stress 
is greater if it is added to a higher 
rather than a lower existing level oi 
stress. 

The two most commonly used forms 
of external stress are those which 
follow the “anxiety’”’ paradigm and 
those which follow the ‘‘distraction”’ 
paradigm. The anxiety types are 
those in which the individual antici- 
pates physical or psychological pain. 
The distraction types are those in 
which the individual is deliberately 
bothered, disturbed, or distracted dur- 
ing the performance of his task. 
Accordingly, an experiment was de- 
signed to test the effects of distraction 
and threat of electric shock on a task 
performance. Four different task de- 
signs were used. The experiment was 
designed so that the effects of distrac- 
tion and the effects of threat of shock 
could be tested alone and in combina- 
tion for each of the task designs. 


The hypotheses specific to this experi- 
ment were as follows: (a) The differ- 
ences in performance obtained with 


different task designs are greater 
under threat of shock than under non- 
threatening conditions. (6) The dif- 
ferences in performance obtained with 
different designs are greater 
under distraction than under non- 
distracting conditions. 


task 


METHOD 
General Procedure 


Subjects—The Ss were 80 male under- 
graduate students from an _ introductory 
psychology class at Johns Hopkins University. 

Task materials—The materials used in 
this experiment were four decks of cards. 


Each deck was made up of 32 square cards, 
5 in. on a side. The cards were photostatic 
duplications of range-ring patterns, covered 
with a transparent plastic which made the 
cards rigid and easy to handle. Each deck 
contained a different pattern of eight rings, 
and on each card one of the rings was marked 
with a large X. The marks were arranged 
so that each ring was marked four times in 
each deck, once in each quadrant. The 
range-rings had an outside diameter of 4.0 
in. and were separated from each other by .25 
in. The rings were solid or dashed black 
lines against a white background. The four 
different range-ring patterns are shown in 
Fig. 1. 

Preliminary trials —The S was seated at 
a table opposite EZ and was handed a pre- 
liminary deck of 32 cards. The cards in 
this preliminary deck were identical in 
every respect with the task cards, except 
that instead of a pattern of range-rings, a 
number from 1 to 8 was marked in the center 
of the card. The S was instructed to deal 
the cards from the top of the deck as rapidly 
as possible and to call off the numbers marked 
in the center of each card. At the signal, 
“ready—go,"’ S ran through the preliminary 
deck. The S was given two trials with the 
preliminary deck, and the time required to 
deal the deck on the second trial was recorded 
by E. The time score for the second pre- 
liminary trial will be referred to throughout 
this paper as S’s preliminary score. 

Immediately after the preliminary score 


Fic. 1. 


Range-ring patterns used in 
the experiment. 
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was recorded, S was handed one of the decks 
of task cards. He was instructed to deal the 
cards off the top of the deck as rapidly as 
possible and to call off the numbers of the 
marked range-rings. He was further in- 
structed to avoid making errors. Each time 
S made an error, he was stopped and his 
attention was called to the error. He was 
then permitted to proceed with the rest of the 
cards in the deck. As soon as one deck was 
completed, S was given the next deck (with a 
different pattern of rings) and so on, until 
all four task decks were run through once in 
this manner. 

Each S next ran through all four decks a 
second time, after being advised that he would 
no longer be told if he made an error. For 
this second series, the cards in each deck were 
arranged in reverse order. The E recorded 
the time required to deal each deck and the 
number of errors made. 

After completing the second preliminary 
series, each S was told that he would be re- 
quired to deal all four decks two more times 
He was asked to try to do better than the 
average score of the Ss who had dealt the 
same cards during the previous week. He 
was further told that he would be paid 5 
cents for each second that his average time 
score for the next two series bettered the 
average time score for the previous week, 
but that 5 cents would be deducted from his 
winnings for each of the average number of 
errors he made. 


Special Procedure 


At this point, each of the 80 Ss was ran- 
domly assigned to one of four experimental 
conditions so as to provide 20 Ss per condition. 
Within each condition, Ss were further ran- 
domly assigned to one of four subgroups of 5 
Ss each. The order of the presentation of 
the four decks of cards to each of these sub- 
groups within an experimental condition was 
counterbalanced. Each of the four experi- 
mental conditions consisted of a different 
combination of two levels of threat of shock 
and two levels oi distraction, as follows: 

Cond. I—No threat, No distraction; 
Cond. II—No threat, Distraction; Cond. III 

-Threat, No distraction; Cond. 1V—Threat, 
Distraction. 


Vartables 


Threat of electric shock.—There were two 
levels of this variable: (a) no threat, and (d) 
threat of an electric shock for every error 


made during the final two series. To produce 
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the threat of shock, an electrode was strapped 
about 3 in. above the left ankle of S immedi- 
ately after he had completed his last pre- 
liminary trial. He was told that he would 
receive an electric shock for every error he 
made during the final two series of task cards 
Actually, S received a mildly painful shock 
for each error during only the first of the final 
two The Ss not shocked 
errors made during the second series. 
The shock apparatus was a variable coil 
inductorium operated by a 4.5-v. DC source. 
rhe intensity of the shock at the 
average level judged to be “just painful” 
by a group of 16 Ss in a preliminary experi- 
ment. The duration of the shock was 
The same setting was used for all Ss. 
Shock for an error during the first of the 
final two series had the added effect of pro- 
viding information about errors to those Ss 
in the threat condition. As a control for the 
influence of this effect, Ss in the nonthreat 
condition were provided information about 
errors made during the first of the final two 
series by a .95-w. red light which was placed 
on the table in front of S and which flashed 
for .5 sec. every time an error was made. 
Distraction.—There two levels of 
distraction: (a) no distraction, and (b) dis- 
traction produced by presenting aurally to 
S a series of random numbers from 1 to 8. 
The random numbers recorded on a 
tape and presented to S through a set of 
earphones in a moderately loud conversa- 
tional tone (approximately 75-80 db above 
threshold After S had completed his last 
preliminary trial, the earphones were adjusted 
on him, and he was given the final two series. 
The recorder was turned on about 3 sec. be- 
fore S was given the signal to begin dealing 
a deck of cards, and it was turned off shortly 
after each deck had been completed. 


series. were for 


was set 


5 sec 


were 


were 


Task design.—There were four task de 
signs. These are represented by the four 
range-ring patterns shown in Fig. 1. Garner, 
Saltzman, and Saltzman (1949) have pre- 
viously shown that these four patterns 


produce significant differences in performance. 

Criteria of performance.—Two criteria were 
used: (a) time to deal a deck of 32 cards, and 
(b) number of errors made. 


Summary of Experimental Design 
rhe 


essentially 


design used in this experiment is 
Lindquist’s (1953) Type III de- 
Eighty Ss dealt all four decks of cards 
There were four combinations of threat and 
distraction with 20 Ss in each 


combination. The design also provided for 


sign. 


conditions 











TABLE 1 


PRELIMINARY SCORES FOR SS ASSIGNED 
ro Eacuh ConDITION 





Condition \ Mean SD 
No threat-no distraction 20 | 20.44 | 2.18 
No threat-distraction 20 | 18.81 | 2.17 
Threat-no distraction 20 | 18.83 | 3.45 
Threat-distraction 20 | 19.12 | 3.07 


counterbalancing order in each of the four 
combinations of 
and 


threat and distraction con- 


ditions for determining the effects of 


order. 
) 
RESULTS 


The time, in seconds, required by 


each S to deal each deck of cards 
during the second of the final two 
series is used in the analysis. In 


order to reduce the variability be- 
tween Ss attributable to differences 
in the sheer mechanics of dealing a 
deck of cards, the time required by 
each S to deal the preliminary deck 
(preliminary 
from each of 


score) was subtracted 
his time scores in the 
final series to provide an adjusted time 
score. The procedure of subtracting 
the preliminary score from S’s final 
time score and performing the analy- 
sis on these adjusted scores had the 
effect of a covariance analysis 
of the final time with time 


to deal the preliminary deck as the 


scores 
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covariable. In fact, such an analysis 
of covariance of part of the data gave 
results identical to the straightforward 
analysis of variance of the adjusted 
time scores. 

The means and SD of these pre- 
liminary scores for Ss in each of the 
four experimental conditions are given 
in Table 1. It can be seen from this 
table that the four experimental 
groups did not differ appreciably on 
these preliminary scores. 

The means and SDs of the adjusted 
time scores for each range-ring pattern 
under each of the four experimental 
conditions are given in Table 2. A 
summary of the analysis of variance 
of the adjusted time scores is given 
in Table 3. Since the effects of order 
and the interactions of order with 
the other variables proved to be not 
significant, order effects are not broker: 
down in Table 3. 
are contained in 
within conditions” 


All effects of order 
the two “groups 
terms in Table 3. 

As can be seen from the row means 
in Table 2, there is a marked per- 


formance decrement under threat 
of shock for errors. There is a 
difference of 7.16 sec. between the 
means of the threat and nonthreat 


conditions which is highly significant 
(P < .001). 
The over-all effects of distraction 


TABLE 2 


\DJUSTED TIM} 


SCORES FOR EACH PATTERN UNDER EACH OF THI 


Four EXPERIMENTAL CONDITIONS 


Range-Ring Pattern 


Rows 
( \ \ I ( Dd 

Mean SD Mear SD Mear SD Mean SD Mean Sp 
No threat-no distraction! 20 | 9.12 | 3.70| 11.09 | 3.57 | 12.35 | 3.34! 12.75 | 4.31 | 11.33 | 4.00 
No threat-distraction 20 | 12.40 | 4.66 | 13.12 | 3.99 | 14.79 | 4.90 | 15.25 | 4.39 | 13.89 | 4.65 
Chreat-no distraction 20 | 17.62 | 7.87 | 21.10) 8.15 | 20.11 | 7.74 | 23.60 | 9.14 | 20.61 | 8.52 
Chreat-distraction 20 | 16.34 | 5.60 | 18.51 | 5.57 | 19.04 | 6.22 | 21.84 | 6.03 18.93 | 6.18 

Columns 80 | 13.87 | 6.59 | 15.96 | 6.91 | 16.57 | 6.58 | 18.36! 7.72 
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TABLE 3 


ANALYSIS OF VARIANCE OF THE CORRECTED 
Time SCORES 


Source df | MS F 

Design patterns (P) 3| 274.27 | 57.38* 
Shock threat (Sh) 1/4104.58 | 31.92* 
Distraction (D) 1} 16.05) .12 
P X Sh 3} 33.00 | 6.90* 
P XD | 3) 6.05 | 1.27 
D X Sh | 1) 359.02 | 2.79 
P XD X Sh | 3} 1.29] °.27 
Groups within condi- | 

tions | 12) 134.84 | 1.05 
P X Groups within 

conditions | 36 4.49 | 94 
Ss within groups | 64) 128.57 | 
P X Ss within groups | 192 4.78 | 

Total | al 

*P <.00l. 


proved to be not significant, there 
being a difference of only .44 sec. 
between the means of the distraction 
and nondistraction conditions. How- 
ever, it will be noted in the row means 
of Table 2 that distraction affected 
performance differently in the threat 
and nonthreat conditions. In the 
nonthreat condition, distraction pro- 
duced a performance decrement which 
is indicated by an increase of 2.56 
sec. in the mean score. I[n the threat 
condition, however, distraction pro- 
duced a performance increment which 
is indicated by a decrease of 1.68 sec. 
in the mean score. In the analysis 
(Table 3) this interaction of the 
Distraction and Threat conditions is 
not significant (P = .10). 

The over-all effect of the different 
range-ring patterns is highly signifi- 
cant. The ¢ tests indicate a differ- 
ence between all adjacent patterns 
beyond the .001 level of significance, 
except for Patterns B and C, where 
05 < P < 10. The critical consid- 
eration for the hypotheses investi- 
gated in the present experiment is 
whether“or not the differences due to 
patterns are the same for the four 
experimental conditions. It can be 
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seen from Table 2 that these differ- 
ences between pattern means are, 
indeed, not the same for all experi- 
mental conditions. The effect of 
threat of shock was to increase the 
differences between pattern means. 
For example, the difference between 
the means of the best (A) and the 
worst (D) patterns, in terms of per- 
formance, under the nonthreat condi- 
tions is 3.24 sec., while the difference 


between the means of these same 
patterns under threat of shock is 
5.74 sec. It should be pointed out 


that, under the threat of shock condi- 
tion, there is an increase in the differ- 
ences between the means of all 
adjacent patterns except for Patterns 
B and C where there is a decrease 
in the difference. The over-all inter- 
action of patterns and threat of 
shock, as shown in Table 3, is signifi- 
cant beyond the .001 level. For all 
adjacent patterns, except B and C, 
this interaction is significant beyond 
the .01 level. 

Distraction, on the other hand, had 
no significant effect on the differences 
between pattern means, as indicated 
by the nonsignificant interaction be- 
tween patterns and distraction shown 
in Table 3. 

As might be expected, Ss made 
occasional errors, even though they 
were instructed to avoid doing so. 
The total number of errors made with 


TABLE 4 
ToraL Errors FOR EACH PATTERN UNDER 


Eacu ConpDITION 


Range-Ring 


Pattern 
Conditions N Total 
A'B\|C D 
No threat-no distraction 20; 4) 16) 16) 27 63 
No threat-distraction 20| 13) 7) 13) 20 53 
Threat-no distraction 20; 3) 9 1) 2 15 
Threat-distraction 20; 2) 2) 2) § i1 
Total 80) 22) 34) 32) 54 142 
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each range-ring pattern for each con- 
dition is given in Table 4. As seen 
in this table, the total number of 
errors is small, representing less than 
1.4 errors for each 100 identifications. 
The only consistent effect is the 
decrease in the number of errors 
under threat of shock. Also, while 
there is some tendency for more 
errors to occur with the poorer 
patterns in the nonthreat condition, 
the errors are fairly constant from 
pattern to pattern under threat of 
shock for errors. 


DISCUSSION 


The significant main effects of range- 
ring patterns and threat of shock on 
performance were, of course, expected 
and serve only to confirm similar findings 
already reported in the literature. The 
effects of distraction, however, were not 
anticipated. The simple effects of dis- 
traction on performance were different 
under the threat and nonthreat condi- 
tions. In fact, there is an inverse 
relationship between these simple effects 
which accounts for the nonsignificance 
of the main effect of distraction in this 
experiment. In the nonthreat condi- 
tion, distraction produced an over-all 
performance decrement; in the threat 
condition, however, the introduction of 
distraction was accompanied by a per- 
formance increment. Though the data 
reported here do not demonstrate this 
interaction of distraction and threat at 
better than the .10 level of significance, 
related effects which have 
been reported by other Es suggest that 
this interaction is probably a real effect 
rather than a chance occurrence. Auble 
and Britton (1958) found that a high anx- 
iety group performed significantly better 
under a noise distraction condition than 
under quiet while a low 
anxiety group performed better under 
quiet conditions. Angelino and Mech 
(1955) reported a similar inverse re- 
lationship between adjustment and per- 
formance under a _ noise distraction 
condition. The results of the present 


distraction 


conditions 
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experiment suggest that distraction may 
have functioned in the threat condition 
so as to reduce the effects of threat. A 
possible explanation for this interaction 
effect is suggested by Lazarus, Deese, 
and Osler (1952). They suggest that 
the compelling preoccupation with con- 
sequences of failure may be disrupting 
to performance, especially on tasks 
requiring concentration. If we can as- 
sume that preoccupation with the con- 
sequences of making an error in this 
experiment (i.e., receiving a shock) was 
disruptive to performance, we may go 
a step further and suggest that the 
addition of distracting stimuli served to 
distract from this preoccupation and 
thereby reduce the effects of threat of 
shock. 

The highly significant interaction be- 
tween patterns and threat of shock, 
supports the hypothesis that the differ- 
ences in performance obtained with 
different task designs are greater under 
threat of shock than under nonthreaten- 
ing conditions. It is, of course, not 
surprising that the time scores should 
increase under the condition of threat 
of shock for errors. The important 
point is that there is a differential in- 
crease in time depending on the ring 
pattern employed and that the increase 
in time is greater for the worst pattern 
than for the best pattern. If we may 
assume, as was suggested earlier, that 
some task designs are inherently more 
stressful than others, then we may re- 
gard the differential effect of threat 
on the performance obtained with the 
different ring patterns used in this study 
as the result of the effects 
of two sources of stress. In other words, 
the increment of stress provided by 
threat of shock has more effect on per- 
formance for the pattern whose inherent 
level of stress is greater. 


combined 


The sugges- 
tion of Lazarus, Deese, and Osler (1953), 
which was mentioned earlier, seems to 
apply here as well. They suggest that 
the compelling preoccupation with con- 
sequences of failure may be disrupting 
for tasks requiring concentration while 
having lesser effect on tasks demanding 


little or no concentration. The findings 
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of the present experiment are consistent 
with this suggestion, since Ss’ comments 
would indicate that those in the threat 
group were, indeed, preoccupied with the 
consequences of making an error, and it 
seems reasonable that the more demand- 
ing patterns require more concentration. 

A comparison of Tables 2 and 4 
suggests a further explanation for the 
interaction obtained patterns 
and the threat variable. In the 
threat condition performance became 
progressively poorer from Patterns A to 
D as measured by either time or error 
However, under threat of shock, 
errors are not only markedly reduced 
but they are fairly constant from pattern 
to pattern. 
the situation where errors do not appear 


between 
non- 


scores. 


It seems, therefore, that in 


to be ‘‘costly,’’ some Ss may be willing 
to tolerate a certain probability of error 
in order to save time. Such errors are 
more with the more difficult 
patterns. However, when errors 
made ‘“‘costly,”” Ss attempt to 
errors even at the expense of greatly 
increased time. This the effect 
of stabilizing errors at a minimum level. 
Since without stress, the probability of 
making an error is greater on the more 
difficult patterns, the nearly complete 
reduction of errors should increase the 
time scores more for the more difficult 
patterns. 

The 


ences in 


evident 
are 
avoid 


has 


that 
obtained 


second hypothesis, differ- 
performance with 
different task designs are greater under 
distraction than nondistracting 
conditions, is not supported by the find- 
ings. The interaction 
between patterns and distraction indi 
cates that distraction, as induced in this 
experiment, failed to influence differ- 
ences in performance obtained with 
different Since this hypothe- 
sis seemed reasonable to begin with, one 


under 


absence of an 


patterns. 


might speculate as to why it was not 


supported by these findings. Perhaps 


distraction was regarded by Ss as ‘“‘some- 
thing to beat,’’ and accordingly 
have affected the level of effort of the 


may 


distraction group. Others (Lazarusetal., 
1953) have pointed out that the effects 
of distraction in laboratory experiments 
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are apt to be confounded with the effects 
of motivation. Perhaps an _ increase 
in level of effort had the effect of de- 
creasing the differences in performance 
obtained with different task designs. 
An increase in level of effort, then, might 
very well have masked the effects of 
distraction on the differences in perform- 
ance obtained with the range-ring pat- 
terns in the present experiment. 

In addition to the above considera- 
tions, the results of this experiment have 
implications for other types of psycho- 
logical research. It shows that differ- 
ences in task design may, in fact, be 
much more effective in producing differ- 
ences in performance than one would be 
led to expect from the usual experiment. 
Many tasks are in fact performed under 
conditions of threat or ‘‘anxiety’’— 
certainly this is true for most military 
tasks. Under conditions it is 
reasonable to expect task design to be 
a more critical factor often 


these 


than is 
assumed. 


SUMMARY 


The purpose was to investigate the effects 
of threat of electric shock, aural distraction, 
and task design on performance. Eighty 
male Ss dealt four decks of 32 cards, each 
deck consisting of a different design pattern 
of eight range-rings. The time to deal each 
deck and the number of errors made in iden- 
tifying a marked ring on each card were 
recorded I'wenty Ss were assigned to one 
of four experimental groups which involved 
different combinations of two levels of threat 
and two distraction Threat was 
induced by attaching an electrode to S and 
informing him that he would be shocked for 
each error. Distraction produced by 
series of random numbers 
aurally to S through a set of earphones 

Performance differences obtained with the 
different range-ring patterns and measured 
in terms of time to deal a deck of cards were 
greater under the threat condition than 
under the nonthreat condition. Distraction 
had no significant effect on the differences 
in performance obtained with the different 
range-ring patterns 


levels of 


was 
presenting a 


indication (P = .10) of 
an interaction between threat and distraction 
conditions. In the nonthreat condition, 
distraction produced a performance decre- 
ment, while in the threat condition, distrac- 


There was some 
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tion was 


increment. 


accompanied by a_ performance 

The total number of errors amounted to 
less than 1.4 errors for each 100 identifica- 
tions. However, there were significantly 
fewer errors in the threat condition than in 
the nonthreat condition. In the nonthreat 
make 
more errors with the poorer range-ring pat- 
terns. In the threat condition, the errors 
were fairly constant from pattern to pattern. 


condition, there was a tendency to 
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SOME EFFECTS OF 


INTRODUCING 


AND WITHDRAWING 


KNOWLEDGE OF RESULTS EARLY AND 
PRACTICE! 
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LATE IN 


Tulane University 


An earlier study by Bilodeau and 
Bilodeau (1958) reported the effect 
of providing schedules of knowledge 
of results (KR) according to fixed 
ratio. It was shown that learning 
depended upon the absolute frequency 
of KR, but not upon the relative fre- 
quency. That is, whatever the dis- 
tribution of KR, non-KR trials neither 
hindered nor facilitated the learning 
produced by KR trials. The present 
study also deals with the relevance of 
KR for learning, but here KR and 
non-KR trials were administered in 
blocks. Such blocks were introduced 
either early or late in practice. 

Because of the results of the pre- 
vious study, it was expected that per- 
formance would improve with KR and 
deteriorate after the withdrawal of 
KR. The major expectation, how- 
ever, was that non-KR trials would 
have no effect on learning when KR 
was eventually introduced. Actually, 
the issues involved here are analogous 
to those of the latent learning, reward 
context, and the experimental design 
was structured accordingly. In the 
latter part of the experiment all Ss 
were tested with KR. Prior to this, 
one group practiced without KR, 
another with KR, and two others with 
a different number of KR trials 
followed by trials without KR. 

A secondary purpose of the experi- 

1 The data were collected at the Air Force 
Personnel and Training Research Center, 
San Antonio, Texas. The costs of preparing 
and publishing the manuscript were borne by 


the Graduate Research Council of Tulane 
University. 
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ment was to provide observations on 
the rate and level of response deteri- 
oration as KR was withdrawn from Ss 
of diverse proficiency. 


METHOD 


Subjects and apparatus.—One hundred and 
sixty naive male Ss were assigned in equal 
number and without known bias to each of 
four conditions of practice on the Manual 
Lever apparatus. A complete description of 
the device was published by Bilodeau and 
Ferguson (1953). 

The S's task was to learn to displace the 
lever by a certain amount. Unknown to S, 
a displacement of 33° of arc constituted a 
perfect pull of the lever. Whenever KR was 
given, it followed the pull by about 5 sec.; 
“10 units high” and “8 units low” are ex- 
amples of KR. The task, then, was to learn 
to pull the lever by an amount which would 
minimize the error reported verbally by E. 

A 20-lb. pull was required to move the 
lever. Thus, proprioceptive cues were strong 
as compared with a line drawing task. 

Procedure.—For each of four groups, 24 
pulls were required. The last five pulls were 
always followed by KR. For Groups 0, 2, 6, 
and 19, the first 0, 2, 6, and 19 trials, respec- 
tively, were also with KR. 
trials were not. 

All 24 trials were run with a 20-sec. trial 
cycle. A “ready” signal preceded the “‘pull’”’ 
signal by 3 sec. The response was made 
within 3-4 sec., and KR was given 5 sec. after 
response completion. About 8 sec. remained 
until the next warning signal. The S released 
the lever on completion of the pull, returned 
his arm to his side and waited for the next 
ready signal. 


The remaining 


RESULTs ? 


The mean absolute errors (actual 
response less 33°) are plotted in Fig. 1. 


2 Two tables of data have been deposited 
with the American Documentation Institute. 
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Mean absolute error in positioning a Manual Lever to 33° on successive trials 
with and without KR (see text) 


The figure is divided into two inde- 
pendent parts. In the first of these 
parts, the curve labeled Control Com- 
parison shows the effects of 19 suc- 
KRs. The Control Com- 
parison represents the errors of three 


cessive 


groups (2, 6, and 19) averaged when- 
ever KR effect. The other 
curves on the left side of Fig. 1 show 
no particular trend in the error when 
KR is entirely absent (see 0), and 
increasing error trends after KR is 
withdrawn and 6). The in- 
creasing error trends of Groups 2 and 6 
marked, though they do not 
suggest an imminent regression to the 
level of the initial error. The per- 
formance of Group 6 certainly deteri- 


was in 


(see 2 


are 
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orates no more slowly than that of 
Group 2 despite the fact that Group 
6 has had four more KRs. Groups 
2 and 6 do not appear sufficient for a 
firm conclusion on deterioration as a 
function of the number of previous 
KRs. A group with KR withdrawn 
after 20 or so trials would have been 
a useful addition to the experimental 
design. 

The second part of Fig. 1 presents 
a test of the latent effect of repeated 
responding in the absence of KR. 
Here, the last five trials of Group 0 
represent the first trials with KR, 
after a of 19 trials without 
KR. Group 19 served as a control, 
since their first five KR trials pro- 
vided their first experience with the 
apparatus. The 
neither Groups nor Groups by Trials 


series 
mean squares of 


reached the .05 level of significance. 
Thus, introducing KR immediately 





144 E. A. BILODEAU, I. 
or late in practice had no differential 
effect upon level of performance and 
rate of learning, i.e., there is no 
demonstrable latent effect. 

The mean algebraic errors were also 
examined. These were near 
zero for Group 19. For the experi- 
mental groups, the errors were near 
zero during KR trials and markedly 
positive in sign in the absence of KR. 
Not only did Ss overshoot when KR 
was omitted, but they also overshot 
by amounts that increased steadily 
the longer KR had been absent. 


errors 


DISCUSSION 


The 


previous study 


with a 
was given 
There was 
(a) no improvement without KR, (0) 
progressive improvement with KR, and 
(c) deterioration of response proficiency 
after the withdrawal of KR. The curves 
of Fig. 1 effectively summarized the clas- 
sical properties of KR. Furthermore, it 
was shown that an initial series of trials 
without KR did not serve to acquaint 
S with properties of the lever system 
that could be gainfully employed (pos- 
itive transfer) with the introduction of 
KR after the twentieth response. How- 
and lamentably so, a failure to 
produce latent learning is not to say 
that latent learning cannot be produced 
(though we may not know how). 

A few KR trials (2 or 6) introduced 
at the very beginning of practice were 
effective in producing relatively long- 
term response modification, though re- 
sponse deterioration did set in immedi- 
ately upon the withdrawal of KR. The 
deterioration, marked in quantity and 
positive in sign, nonetheless remained 
substantially below the level of errors 
produced when KR was entirely absent. 
The positive effect of a single KR trial 
is probably much greater (and more 
lasting) than the negative effect of a 
non-KR trial. 

Dees and Grindley (1951), and others, 
have also observed increasing amplitude 
of response upon withdrawal of KR. 
A naive application of Hull’s Jz theory 


consistent 
KR 


ratios. 


results were 
where 


according to fixed 


ever, 
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(1943) to deterioration phenomena would 
predict decreasing amplitude of response, 
e.g., progressive undershooting would 
follow the build-up of Jr. Perhaps, it 
would be better to state that Jz interacts 
with or damps proprioceptive feedback. 
Thus, with the accumulation of Jp, 
responses must progressively lengthen 
during non-KR trials in order to produce 
internal feedback sufficient on Trial m 
to match feedback on Trial — 1. The 
assumption here is that S is matching 
proprioceptive trials 
without KR. If this were done without 


feedbacks during 


the accumulation of Jp, a series of re- 
sponses would be of uniform amplitude. 
With increasing Jr the amplitude of the 
response might well rise in compensation 
for a decreasing proprioceptive feedback. 


SUMMARY 


This experiment deals with the late intro- 
duction and removal of knowledge of results 
Its purpose was, by showing in a single experi- 
ment the effects of basic manipulations, to 
reassert the powerful relationships between 
response and the presence or absence of KR 
The task was lever-displacing; the stimulus 
(KR) was the amount and direction of the 
reported error. 

The experiment showed (a) no improve- 
ment without KR, (b) progressive improve- 
ment with KR, and (c) response deterioration 
after the withdrawal of KR. Further, an 
early series of trials without KR had no 
latent effect on the learning shown when KR 
was eventually introduced. 


REFERENCES 


BiLopeau, E. A., & BiLtopgeau, I. McD 
Variable frequency of knowledge of results 
and the learning of a simple skill. J. exp. 
Psychol., 1958, 55, 379-383. 

BiLopEAU, E. A., & FERGuson, T. G. A 
device for presenting knowledge of results 
as a variable function of the magnitude of 
the response. Amer. J. Psychol., 1953, 66, 
483-487. 

Dees, V., & GrinDLEY, G. C. The effect of 
knowledge of results on learning and per- 
formance: IV. The direction of the error 
in very simple skills. Quart. J. exp. Psy- 
chol., 1951, 3, 36-42. 

Hutt, C. L. Principles of behavior 
York: Appleton-Century, 1943. 


New 


(Received August 21, 1958) 





Journal of Experimental Psychology 
Vol. 58, No. 2, 1959 


DIFFERENTIAL EFFECTS O 


fF 


CONTINUOUS EXTINCTION 


AND DISCRIMINATION TRAINING ON THE 
GENERALIZATION GRADIENT ! 


WERNER K 


HONIG 


Denison University 


DAVID R 


FTHOMAS? anp NORMAN Gl 


rTMAN 


Duke University 


A basic assumption in the tradi- 
tional stimulus-response analysis of 
discrimination learning is that excita- 
tory and inhibitory response tenden- 
cies are independently developed to 
the positive (St) and negative (S~) 
stimuli as a result the action of 
reinforcement and nonreinforcement. 
The only “interaction” occurring be- 
tween these stimuli results from the 
generation of gradients of excitation 
and inhibitation which have S* and 
S~ as their respective maxima, and 
which summate algebraically on the 
stimulus dimension on which S* and 
S~ are differentiated. This analysis, 
first proposed by Spence (1937) to 
account for transposition, permits the 
prediction that after discrimination 
training, the resulting generalization 
gradient has its maximum not at S* 
but at a previously neutral stimulus 
value displaced away from the S 


of 


lo support the assumption of gradients of 
excitation and inhibition, Spence cites the 
work of Pavlov (1927) and of Bass and Hull 
(1934) in the area of classical conditioning 
More recently, Guttman and Kalish (1956 
have made a reliable determination of gener- 
alization gradients along the spectral dimen- 
sion in an operant conditioning situation 

! This study was supported by Grant MH- 
1002 from the National Institute of Mental 
Health, United States Public Health Service 
The authors are indebted to Deborah Nicker- 
son and Robert Langston for assistance in 
running the Ss. A modified version of this 
paper was presented at the meetings of APA, 
New York, 1957. 

2 Now at Kent State University 
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Honig (1956) was able to establish a gradient 
of extinction after training response to 
a number of stimulus values and extinguishing 
one value. In unpublished research, 
Jenkins* obtained a gradient of extinction on 
the dimension of auditory frequency, after 
training a discrimination where silence served 
S* and a tone as S These experiments 
give support to Spence’s assumptions about 
the existence of gradients of excitation and 
inhibition, although the forms of the gradients 
are not always similar to those postulated 
The question remains whether the achieve- 
ment of a discrimination is merely an additive 


on 


as 


combination of the processes of exc itation and 
inhibition at different points on the stimulus 
continuum, or whether some more complex 
interaction ts involved 

Hanson (1956, 1957 


spectral generalization 


has investigated the 
the 
establishment of a successive discrimination 
He did indeed obtain a displacement of the 
maximal value, or “‘peak shift,”’ in the dire 

tion predicted by Spence 


gradient after 


However, he found 
considerable difficulty in attempting to derive 
his empirical gradients from the summation 
of postulated excitatory and inhibitory gra 
dients. His discussion is too complex to be 
reviewed here, but one point is particularly 
relevant to the present study: any postulated 
inhibitory gradients accounting for his data 
would have to have maxima at the S* 
than the S 
certainly 


rather 
a determination that 
contrary to analysis 
Hanson concludes that the best description 
of the effect of discrimination 
“relocation” of the primary 

gradient along the stimulus continuum 


value, 1s 


pence s 
training is a 
generalization 
The 
simple summation of independently generated 
gradients of excitation and inhibition cannot 
account for this 


“relocation”; the develop 


ment of a discrimination appears to involve 


additional processes that are sui generis, and 


* Personal communication 
Jenkins, August, 1958 


from Herbert 


M 
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not accounted for in terms of the combination 
of effects of simple acquisition and extinction. 


This problem may be approached 
in another way: If the additive effects 
of excitation and inhibition are re- 
sponsible for the effects of discrimina- 
tion training, extinction at some point 
on the generalization gradient should 
have the same effect on the gradient as 
discrimination training. While there 
may be some differences, particularly 
in over-all response level, due to the 
continued presentation of reinforce- 
ments in discrimination 
training, some tendency to a peak 
shift should be observed in both cases. 


successive 


The present experiment was designed 
to test this hypothesis. After being 
trained to respond to one stimulus 
(S®), pigeons were given massed, or 
continuous extinction at another value 
(S®) lying on the generalization gradi- 
ent. The effects of the extinction could 
be estimated by comparing the subse- 
quent generalization gradient with 
a control gradient obtained when no 
extinction was given. Ifa gradient of 
extinction is generated after continu- 
extinction at S®, it should be 
After 
the postextinction gradient was ob- 
tained, the Ss were given successive 
discrimination training with S® and 
S® serving as S* and S~, respectively, 
so that the effects of this training could 
be compared with those of continuous 
extinction. 


ous 


observed in such a comparison. 


METHOD 


The study was carried out in two separate 
experiments identical in procedure but differ- 
ing slightly with regard to breed of Ss and 
brightness equalization of the stimuli. 
these differences appear to be responsible for 
differences in height and form of the generali- 
zation gradients, the data from the two experi 
ments have not been combined, although they 
are presented and discussed together. 

Subjects. 


Since 


Eighteen homing pigeons were 
used in Exp. |. Twenty-two White Carneau 
pigeons were used in Exp. II. 
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Apparatus—An automatic key-pecking 
apparatus similar to that described by Gutt- 
man and Kalish (1956) was used in both 
experiments. The light source for the il- 
lumination of the key was a Bausch and Lomb 
monochromator, which employs a diffraction 
grating for the dispersion of white light into 
the spectral continuum. The band width of 
the spectral stimuli used in the present study 
was about 16 my. In Exp. I, the stimuli were 
equated for apparent brightness by means of 
a neutral density optical wedge. A human 
observer made the equations with the aid of a 
Macbeth illuminometer. Blough’s finding 
(1957) that the pigeon’s luminosity function 
is similar to that of man provides justification 
for this procedure. In Exp. II, brightness 
values were not equated. 

Procedure.—Preliminary and variable in- 
terval training: The Ss were reduced by 
starvation to between 75 and 80% of their 
free-feeding body weight. They were maga- 
zine-trained and conditioned by 
approximations to peck at the key illuminated 
bya light of 550 mp. After one session of 50 
continuous reinforcements, variable interval 
(VI) training was begun. VI training ex- 
tended over 10 daily sessions, each consisting 
of 30, 1-min. “stimulus-on” periods in which 
the 550 my light was presented alternately 
with 10-sec. periods of blackout. The VI 
schedule had a mean inter-reinforcement inter- 
val of about 1 min. The programmer did not 
operate during the blackout periods, and no 
reinforcements could be obtained at that time 

Continuous extinction and the first gen- 
eralization test: On the day following the last 
VI session, continuous extinction on 570 mu 
was given by presenting that stimulus without 
reinforcement for 1-min. periods separated by 
10-sec. blackout intervals. Group 1 (six Ss, 
Exp. I; eight Ss, Exp. II) received 20 min 
of extinction, while Group 2 (six Ss, Exp. I; 
eight Ss, Exp. II) received 40 min. The 
control Group (six Ss, both experiments) 
On the following day, 
each S was tested for generalization in extinc- 
tion. Eleven stimuli were used, ranging from 
490 to 610 my in 10 may intervals, with the 
omission of 500 and 600 my. Each test 
consisted of 12 blocks of stimulus presenta- 
tions in which each stimulus occurred once. 
The 11 stimuli were presented in a random 
order within each block. Each presentation 
was for 30 sec. and was followed by a 10-se 
blackout period to allow for the changing of 
the stimulus and the recording of data. Since 
the control group received no extinction, its 
generalization test was given on the day 
following the completion of VI training 


successive 


received no extinction. 
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Discrimination training and the second 
generalization test: On the day after the first 
test, each S from Groups 1 and 2 received 10 
continuous reinforcements with the 550 my 
stimulus in order to reinstate the conditioned 
operant. Successive discrimination training 
started immediately thereafter. The latter 
was carried out for 80 min. in all: 30 min. on 
Days 1 and 2, and 20 min. on Day 3. Re- 
sponding to 550 my, the positive stimulus, 
was reinforced according to the VI schedule 
described Responding to 570 mu, 
the negative stimulus, was never reinforced. 
Each stimulus was presented for 1 min. 
periods followed by 10 sec. of blackout. 
Within each block of 10 periods, St and S 
were each presented five times. The order 
of presentation was randomized from block 
to block Gellerman series 
(Stevens, 1951) no stimulus being presented 
for more than three successive periods. One 
S from Group 2 of Exp. II died before the 
completion of this phase of the research, 
reducing the N for that group to five. 

Unlike the proce- 
dure, successive discrimination training was 
carried out for an equal amount of time for 
all Ss, with 40 min. of presentation of each 
stimulus. A few Ss _ stopped responding 
entirely during part of the scheduled dis- 
crimination training procedure. If the cessa- 
tion of responding continued for 10 or more 
successive periods, these presentations were 
repeated at the end of the third training 
session in order to insure that all Ss received 
equal amounts of “‘active’’ training. 


above 


according to a 


continuous extinction 


RESULTS 


The effects of continuous extinction. 
The postextinction gradients (PEGs) 
and control gradients are presented 
in Fig. 1 in terms of the mean number 
of responses obtained at each stimulus 
value on the first generalization test. 
In order to estimate the effects of the 
extinction procedure, the PEGs have 
also been subtracted from the control 
gradients, the differences being shown 
below the line of zero response level 
by curves which may be called “gra- 
dients of decrement.’’ From inspec- 
tion of Fig. 1 it is apparent that (a) 
the PEGs from Exp. I are very 
similar, (6) they are considerably 
lower than the control gradients, (c) 
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Fic. 1. Postextinction and control gradients 
from Exp. I (above) and Exp. II (below 


the reduction indicated by the gra- 
dient of decrement is a general one 
and appears to be roughly propor- 
tional to the height of the control 
gradient at each point. The decre- 
ment is not centered around 570 my; 
it is smaller, in fact, at that point 
than at 550 and 560 mu, and, for the 
40-min. group, at 580 my. In Exp. 
Il there appears to be a difference in 
the form of the PEGs, that for the 
20-min. group being somewhat steeper. 
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This is supported by a_ two-factor 
mixed design analysis of variance 
(Lindquist Type I, 1953) on the 
gradients from the 20- and 40-min. 
groups, where wave length serves as 
the ‘‘within-Ss”’ variable, and extinc- 
tion time as the ‘‘between- Ss”’ variable. 
Asignificant interaction between treat- 
ment variables indicates significant 
slope differences (F = 2.2; 10, 125 df, 
P < .05), although the difference in 
over-all response level is not signifi- 
cant. Since the assumption of homo- 
geneity of variance at different stimu- 
lus values is of dubious validity, this 
analysis should be interpreted with 
considerable caution. The gradient 
of decrement for the 40-min. group 
again shows a general reduction in 
response strength, with the maximal 
reduction clearly at 550 mug. For the 
20-min. group, the reduction is maxi- 
mal at 570 my and decreases to either 
side of this value. This is the only 
one of the four gradients of decrement 
which actually appears to be centered 
around S¥ rather than S®*. 

There is considerable evidence 
(Guttman & Kalish, 1956; Kalish & 
Guttman, 1957) that gradients differ- 
ing in over-all response level but other- 
wise obtained under similar cond'tions 
are multiplicatively related. If the 
effect of continuous extinction repre- 
sents no more than a general reduction 
in response level, a multiplicative rela- 
tionship should be observed between 
the PEGs and the respective control 
gradients. In such a case, the per- 
centage obtained at each stimulus 
of the total number of responses from 
the entire gradient should be identical 
for the PEGs and control gradients. 
These percentages are plotted in Fig. 
2. It is clear that in Exp. I they are 
similar for ali groups. In Exp. II, 


on the other hand, there appear to 


be systematic differences between the 
percentage gradients. That of the 
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Fic. 2. Postextinction and control gradi- 
ents from Exp. I (above) and Exp. II (below) 
plotted in terms of percentage of total re- 
sponses at each stimulus value 


40-min. group is flatter than that of 
the control group, while that of the 
20-min. group shows a higher per- 
centage at 540 and 550 my, and a 
lower percentage from 560 to 590 mu, 
with the largest difference occurring 
at the S®. An analysis of variance 
similar to that described above on 
the three percentage gradients from 
Exp. II shows a significant interac- 
tion between the treatment variables 
(F = 2.94; 20, 190 df, P < .01). 
Since the slope differences between the 
gradients of the 20-min. group and 
the control group are of particular 
interest, a separate analysis was done 
with these two gradients. No sig- 
nificant interaction was obtained, 
(F = 1.36; 10, 120 df) so that even 
the apparently specific effect in the 
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TABLE 1 


MEANS OF THE PEGs ANnp PDGs WITH THE 
DIFFERENCE BETWEEN EACH EXPERI- 
MENTAL GRADIENT MEAN AND 
THE RESPECTIVE CONTROL 
GRADIENT MEAN 


Postextinction Postdiscrimination 


Group Exp Exp 
Mean Control Mean Control 
PEG Differ PDG Differ 


ence ence 
Experiment 


20-min. 
40-min. 
Control 


20-min. 
40-min 
Control 


area of S® is put into doubt; the 
greatest part of the interaction is 
presumably contributed by the flat- 
tened gradient of the 40-min. group. 

A measure which is particularly 
sensitive to changes in the form of the 
gradient is the gradient mean, as it 
reflects the response level at all test 
values. If the present gradients are 
multiplicatively related, their means 


will be the same; a specific decrement 


in the region of the S®, however, 
will be reflected by a mean shifted 
toward the lower wave lengths. The 
gradient means are presented in Table 
1 together with the differences be- 
tween each experimental gradient 
mean and the relevant control gradi- 
ent mean. The means of the PEGs 
reflect little, if any, change from the 
control the largest 
mu. In 
there is even an increase of 1.7 mu. 
The effects of discrimination train- 
ing.—The postdiscrimination gradi- 
ents (PDGs) are presented in Fig. 3 


gradients, de- 


crease being 1.6 one case 
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the mean number ol 
responses obtained at each stimulus 


in terms of 


value on the second generalization 
test. Compared to the control gra- 
dients, the PDGs show a suppression 
of responding in both experiments at 
values above 550 mu. At 550 and 
540 muy, the response level is higher 
after discrimination training. In Exp. 
II, the mode of the PDG is at 540 my 
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for both groups; this is particularly 
striking for the 40-min. group. In 
Exp. I, the mode remains at 550 my, 
but the value at 540 my is high enough 
to suggest a “‘truncated’’ peak be- 
tween 540 and 550 my, which may be 
estimated by a method of linear extra- 
polation suggested by Hanson (1956). 
This procedure reveals an estimated 
peak at 546 my for the 20-min. group 
and 545 my, for the 40-min. group. 

The effects of discrimination train- 
ing are striking enough to make a 
presentation in terms of percentage 
gradients unnecessary. It is quite 
clear that the gradients are radically 
altered in a manner that is non- 
multiplicative with the control gra- 
dient. The “relocation” of the gra- 
dient, similar to that obtained by 
Hanson, is reflected in the gradient 
means, which can be seen in Table 1. 
Every PDG mean is lower than the 
respective control mean, the smallest 
difference being 7.7 mu. 


The results suggest that any specific 


effect on the generalization gradient 
is contingent on the successive dis- 
crimination training procedure. The 
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possibility remains, however, that the 
continuous extinction procedure had 
some subtle effect, if not on the PEG, 
then possibly on subsequent discrimi- 
nation learning or on the PDG. In 
that case, we might expect systematic 
differences between the gradient means 
or some measure of discrimination 
learning as a function either of the 
time spent in continuous extinction, 
or the number of responses emitted 
under that condition. That the time 
in continuous extinction is not related 
to differences in the gradient means, 
either before or after discrimination 
training, is clear from Table 1. The 
differences are in some cases larger 
for the 40-min. group, and in some 
the 20-min. group. To 
examine the relationship between the 
number of responses in continuous 
extinction and these measures, each 
experimental group was divided in 
half, separating the high and low 
responding Ss in continuous extinc- 
tion. The mean number of responses 
obtained under that condition is pre- 
sented in Table 2 for each group, to- 
gether with the means of the individual 


cases for 


TABLE 2 


VARIOUS MEASURES FOR SS RESPONDING AT HIGH AND Low Rates DuRING 
CONTINUOUS EXTINCTION 


Group Response Rate 


in Extinction 


Mean Responses 


Mean PEG 
Mean 


Mean PDG 
Mean 


Jiscrimination 
ndex 


Experiment I* 


20-min. High 50. 
Low 364. 
High 55.5 
Low 23. 


( 


40-min. 


558.7 my 
559.8 547.2 
556.7 547.0 
555.1 553.7 


549.2 my 


Experimen 


20-min. High 
Low 
High 
Low 


40-min. 


546.0 
544.0 
542.2 
540.1 


uuu 
mn nm 
anc- 


un 


* Of the five Ss in the 40-min. group, the one giving the median rate was omitted from the analysis. 
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PEGs and PDGs as well as an index 
of discrimination learning, which is 
the ratio of the responses to S~ to 
those to S* in the last 10 min. of 
training. It might be expected that 
a greater number of responses in 
extinction would be correlated with 
a larger shift in the mean of the 
gradient and a lower discrimination 
index. The pairs of measures show- 
ing differences in the expected direc- 
tion are underlined ; they amount only 
to one-quarter of the total. 


DISCUSSION 


The foregoing analyses lead to the 
conclusion that the effects of continuous 
extinction on the generalization gradient 
bear little or no relation to the S®, and 
that they do not resemble the effects of 
successive discrimination training. The 
only reliable effect of the former that 
we have observed is a general reduction 
in response level of the postextinction 
With appears 
fruitless to attempt an explanation of 
the 
crimination 


tests. this in view, it 


“relocation”’ of responses after dis- 
training by means of the 
interaction of separate gradients of ex- 
citation and extinction. The changes 
generated by discrimination training ap- 
pear at this point to be sui generis, and 
they will have to remain so until a more 
fruitful approach to their explanation 
can be suggested. 

Such an approach may be represented 
by the study of 
trinsic to the discrimination situation. 
The amount of shift of the mean of the 


variables that are in- 


gradient would serve as an important 
dependent variable. Various differences 
conditions may 
independent variables; by 


in training serve as 
an examina- 
tion of each, we may be able to deduce 
those factors in discrimination training 
that are important for the occurrence 
of the peak shift. Honig (1958), for 
example, has shown that simultaneous 
little effect 
compared to 
successive, and produces no peak shift. 
Hanson (1956) that the 


discrimination training has 


on the gradient, when 


has reported 
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peak shift varies as a negative function 
of the CS* CS~ difference. One of the 
authors (Thomas) has evidence from an 
unpublished study that the amount of 
training may be the vehicle through 
which this variable of stimulus difference 
operates. In his study, two groups of 
Ss showed identical peak shift after the 
same amount of training on two problems 
one of which involved a CS*, CS> dif- 
ference more than twice the other. In 
the same study, that 
the peak shift the 
achievement of discrimination cri- 
The experiment also 
demonstrates considerable peak shift 
after an incomplete discrimination, as 
does an unpublished study by Guttman 
in which a discrimination was made be 


Thomas showed 


occurs well before 
the 


terion. present 


tween stimuli associated with two differ 
ent reinforcement schedules, one of which 
was more favorable than the other. 
Many basic questions about uperant 
discrimination and the postdiscrimina 
tion generalization gradient remain un 
answered. What is the effect of the 
proportion of negative to positive periods 
in the 
tion? 


discrimination situa 
What is the effect of the length 
of presentation periods? Of what influ 


ence is the reinforcement schedule asso 


suc cessi ve 


ciated with the positive stimulus; with 


the negative? These examples may 


suffice to indicate the kinds of questions 


that must be answered before the effects 
of discrimination training on the general 
ization gradient may be understood. 


SUMMARY 


The study was designed to investigate the 
effect of continuous extinction at a stimulus 
lying on a generalization gradient generated 
by reinforcement at another stimulus value 
The effects of this procedure were compared 
to those of presenting successive discrimina- 
tion training with the CS as the positive 
stimulus and the stimulus at which continu 
ous extinction was given as the negative. It 
was found that continuous extinction has only 
the effect of reducing the response strength 
at all points on the generalization gradient 
in multiplicative while 
discrimination training led to 


fashion, SUCCESSIVE 
1 suppression 


of responses, specifically in the area of the 
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1egative stimulus, and a shift both in the 
mean and the mode of the gradient away 
from that value. The conclusion was drawn 
that excitation and inhibition cannot inde- 
pendently account for the effects of successive 
discrimination training; the mixed presenta- 
tion of positive and negative stimuli appears 
to be essential. Certain variables inherent 
to discrimination training are suggested, 
which, when investigated, may clarify the 
basis of the effects of the formation of a 
successive discrimination on the generaliza- 
tion gradient 
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TIME ESTIMATION AS A FUNCTION OF LEVEL OF 


BEHAVIOR OF 


RUTHANNE K. S 


DEWOLFE anp CARL 


SUCCESSIVE TASKS 


P. DUNCAN 


Northwestern University 


If Ss are asked to estimate the 
duration of a fixed interval of time 
during which they are occupied with 
task (including “rest’’), their 
estimates vary inversely with certain 
characteristics of the task (Axel, 1924; 
Gilliland, Hofeld, & Eckstrand, 1946; 
Gulliksen, 1927 ; Harton, 1938; Weber, 
1933: Woodrow, 1951). Axel (1924) 
used the phrase “level of behavior”’ 
to refer to all such characteristics of 
tasks (interest, difficulty, cognitive 
demand, etc.) that 
tion of passage of time. 
taken together with a similar study 
by Gulliksen (1927), that 
such tasks as rest or tapping a pencil, 
at low levels 
behavior, produced overestimation of 
a fixed interval. An analogies test 
or long division problems, tasks con- 
sidered to be at high levels of behavior, 
yielded underestimation of time. An 
intermediate level was 
assumed to be represented by number 
or mirror reading ; these 
tasks yielded intermediate estimates. 

In a recent study by Kleiser (1953), 
S first worked for a 
fixed (standard) interval, then rested 
(comparison interval) until he judged 
as much time had elapsed as had 
elapsed during the standard interval. 
Kleiser’s with this method 
were essentially the same as_ those 
found by the the 
greater the ‘‘assumed cognitive de- 
mand” of the task used during the 
standard interval, shorter S’s 


a 


some 


influence estima- 
Axel’s study, 
showed 
considered be 


to ol 


of behavior 


canceling 


on some task 


results 


earlier workers; 


the 


estimate, i.e., the shorter the compari- 
son interval. 


The present study is an extension 
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Tasks assumed 
levels 


of Kleiser’s method. 
represent 
havior 


to various of be- 
used during both the 
standard and the comparison inter- 
vals. It assumed that rest, 


versed alphabet printing, and anagram 


are 


is re- 
solving represent low, intermediate, 
and high levels of behavior, respec- 
tively. These three tasks 
in all nine possible combinations as 
first, standard, 
comparison, tasks. 


are used 


or and second, 


or 

If the assumption about the rank 
order of these tasks along the dimen- 
level 
results 


sion of of behavior is correct, 
the predictable. 
Suppose, in one extreme case, that 
S the fixed, standard 
interval, then attempts to reproduce 
that interval’s duration by 
anagrams during the comparison in- 
terval. Since S during the 
standard interval, the interval seems 
long to him. He 


reproduce this “‘long”’ 


should be 


rests during 
solving 
rests 
now attempts 
interval while 
solving anagrams, a task in which time 
seems to go quickly, i.e., 
underestimated. 
mate, in time elaspsed during the 
comparison interval, should be longer 
than with any other combination of 
the tasks. the 
other extreme case, anagrams followed 
by rest, should produce the shortest 
estimates of all. 
these extreme cases, the prediction 
that the higher the level of be- 
havior the the 
shorter should be the estimates, but 
the higher the level of behavior of the 
comparison task, the longer should 
be the estimates. 


to 


is ordinarily 
Therefore, his esti- 


present Conversely, 


Generalizing from 


1S 


of standard task, 
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One presentation of a_ standard 
task followed by a comparison task 
is called one trial. Several trials are 
order to check a finding of 
Kleiser’s that 


given i 


time estimates tended 


to vary with progressive improvement 


in performance on the tasks them- 
selves. No attempt is made to pre- 
dict the results except on the first 


trial. 

METHOD 
Materials.—Reversed alphabet printing 
was done in blue books; a separate page was 
used for each trial. For the anagram task, 
50 five-letter anagrams were selected from 
the lists of Rees and Israel (1935). Thirty 
randomized lists of the 50 anagrams were 
prepared by table of random 
numbers Ss receiving the anagram 
task during the standard intervals were 
assigned the lists arbitrarily numbered from 
1 to 15 (all Ss received 15 trials). The Ss 
receiving anagrams during the comparison 
intervals worked on Lists 16 through 30 
Each list was typed lower case in two columns 
of 25 anagrams each on a separate sheet of 
paper, with the anagrams 
secutively from 1 to 50. 

A cardboard screen on the table at which 
E and S sat prevented S from seeing any 
of the materials or the .01 sec. Standard 
Electric clock used to time both standard 
and comparison intervals. 

Design.—The three tasks, rest, reversed 
alphabet printing, and anagram solving were 
used in all both the 
standard interval and the comparison inter- 
val. The Ss, 135 students in introductory 
psychology were assigned in turn 
to the nine groups, 15 Ss per group. The 
duration of the standard interval was 26 sec. 
on each trial. were 15 trials, each 
consisting of a standard and a comparison 
interval. 


means of a 
Those 


numbered con- 


nine combinations in 
courses, 


There 


The duration of the comparison 
interval, to the nearest whole second, was S's 
score. The S did not know the duration of 
the standard interval, and no knowledge of 
results was given 

Procedure.—Each S was first given a pre- 
trial with each of the tasks on which he would 
eventually work. For rest, the pretrial con- 
sisted of having S lean back in his chair, 
close his eyes, and relax as completely as 
possible for 5 sec. For anagrams, S solved 
in the pretrial as many as he could of a list 
of 10 practice anagrams. The pretrial for 
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reversed alphabet printing consisted of print- 
ing the alphabet once, printing the letters up- 
side down and from right to left. 

After the pretrial, S 
follow 5. 


instructed as 
He would be given a number of 
trials, each of which consisted of two parts. 
For the first part of each trial E would say 
“Go” and S would engage in the activity 
assigned for the standard interval until E 
said “Stop.” For the second part of each 
trial, E would again say “Go’’ and S would 
engage in the activity assigned for the com- 
parison interval. Then E said, “However, 
during the second part of the trial you will 
continue to rest (solve anagrams, or print the 
alphabet in reverse, as appropriate) only 
until you feel that you have rested (worked) 
as long as you rested (worked) during the 
first part of the trial when I said ‘Stop.’ 
Then you must say ‘Stop.’ When you feel 
that as much time has elapsed during the 
second part of the trial as elapsed during the 
first part of the trial when I said ‘Stop,’ 
you must say ‘Stop.’ You will be trying to 
match the two times exactly."” The S was 
told not to use any kind of counting method 
on which to base his reproductions. 


was 


After these instructions, S was given in- 
structions specific to the task or tasks he 
would be working on. The Ss who were to 
do reversed alphabet printing were told to 
work as quickly and as accurately as possible 
If in a single period (half-trial) they com- 
pleted the alphabet, they were to continue 
right on with A again, but at the beginning 
of each new period they were to start with 
the letter following the last letter printed in 
the preceding period. The E presented S 
with a fresh page in the blue book for each 
half-trial and told him the letter with which 
to begin. The Ss were instructed not to 
copy previous work, but to print each letter 
independently. 

The Ss who were to solve anagrams were 
told that during each period they were to 
solve as many anagrams as possible without 
using pencil and paper. As soon as S had 
solved an anagram, he was to call out its 
number and correct solution, and go on to 
any other anagram on the sheet. 

Before each standard or comparison inter- 
val, E said to S, “‘This time you will be resting 
(or solving anagrams, or printing the alphabet 
in reverse). Remember, this time it is your 
(my) turn to say ‘Stop.’ You must (I will) 
say ‘Stop’ this time.” Giving this instruction 
and handing S whatever materials he needed 
took no more than 5 sec. between standard 
and comparison intervals. Time 
trials was also about 5 sec. 


between 
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Fig. 1. Mean log time estimates on Trial 1. 


RESULTS 


Trial 1.—Since an attempt was 
made to predict the results on Trial 1, 
the data for this trial will be analyzed 
separately. Inspection of the Trial 1 
data revealed some positive skew in 
in each of the nine distributions. Be- 
cause of this, each raw score was con- 
verted toa logarithm. Hartley’s test 
for heterogeneity of variance among 
Fraz = 2.33, 


the log scores yielded 
which, with 9 and 14 df, is not sig- 
nificant. 

The mean log time estimates are 


plotted in Fig. 1. Because the precise 
location of the tasks along the dimen- 
sion “‘level of behavior”’ is not known, 
the tasks have been arbitrarily spaced 
equidistantly from each other on the 
baseline. By inspection, it appears 
that both variables had an effect in 
the predicted direction; estimates 
decrease with level of 
behavior of standard task and in- 
crease with increasing level of 
havior of comparison task. 

The analysis of variance on the 
means supported the trends suggested 
in Fig. 1. The F for standard task 
was 33.0; the F for comparison task 
was 69.6. With 2 and 126 df, both 
these 
.001 level. 
significant. 


increasing 


be- 


The interaction was not 


values are significant at the 


Although no attempt was made to 
measure the accuracy with which Ss, 
working on various tasks, can repro- 
duce a 26-sec. interval when they 
know the duration of the interval, 
it is of interest to note, in Fig. 1, 
that the absolute duration was re- 
produced most accurately by those 
three groups whose comparison task 
was the same as their standard task. 
The log of 26 is 1.415; the means for 
the Rest-Rest, Alphabet-Alphabet, 
and Anagram-Anagram groups were 
1.40, 1.41, and 1.45, respectively. 

Trend effects.—To examine the data 
for changes in time estimates over the 
15 trials, the 15 raw scores were 
combined into five blocks of three 
trials each. A graph (not presented 
here) of these five trend points for 
each of the nine groups suggested that 
there was very little change in time 
estimates over the 15 trials. A trend 
analysis on these data, to test for 
the effect of trials, showed that neither 
Trials, nor any of the interactions 
with Trials, was significant; time 
estimates did not vary with repeated 
experience with the two-task pro- 
cedure (even though scores on both 
the alphabet and anagram tasks 
improved over trials). The scores 
were fairly stable in 
respect ; the correlation between mean 
score on Trials 1-3 and mean 
on Trials 13-15, based on all 135 Ss, 
was .803. 

The trend analysis also yielded 
terms for the effects of the main 
variables on total raw score over all 
15 trials; these terms are shown in 
Table 1. It that in 
contrast to the score on Trial 1 alone, 
the interaction between standard and 
Be- 
cause of this, the interaction was used 
the 
the main 


also another 


score 


can be seen 


comparison tasks was significant. 


as error to compute Fs, shown 


in Table 1, for 
Even though the df are only 2 and 4, 


variables. 
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TABLE 1 


EFFECT OF MAIN VARIABLES ON TOTAL 
Raw Score Over ALL TRIALS 


Source é US I 


Standard task (S) 
Comparison task 
(C) 3,557.59 | 9.60* 
SEs. 3,288.71 | 8.04*** 
Between Ss in 
same group 


101,490.60 | 4.36 


2,896.73 


*P < 05 
P< 001 


the F for the comparison task was 
significant at the 5% level, and the 
F for the standard task reach the 10% 
level. (With Between-.Ss-in-same- 
group as error, the Fs for standard 
and comparison tasks were 35.04 
and 77.18, respectively.) 

Since the effects of the main vari- 
ables on total raw score over all trials 
were not the same (because of inter- 
action) as on log score on Trial 1, the 
mean of each S’s score over all 15 
trials was converted to a log to 
provide greater comparability with 
Trial 1 data. The Fyre: on the vari- 
ances of these log scores was 19.35, 
significant well beyond the 1% level, 
so the results must be interpreted 
conservatively. The analysis of these 
log scores showed that the interaction 
between the main variables was sig- 
nificant (F = 3.31), although only 
at the 5% level in this case. With 
the interaction as error, both main 
variables were significant; F = 8.78 
(P < .05) for standard task, F = 21.56 
(P < .01) for comparison task. (With 
Within-groups as error, F = 29.04 for 
standard task, F = 71.32 for com- 
parison task.) Despite heterogeneity 
of variance, on the whole it appears 
that on mean score over all 15 trials 
there was a significant interaction 
between standard and comparison 
task variables, but that over and 
above the interaction, the variables 
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still exerted significant effects. These 
effects were in the same direction as 
those shown in Fig. 1, as may be seen 
by comparing the points in the figure 
with the mean log over all 
trials, shown for each group in Table 
? 


score 


The significant interaction among 
standard and comparison task means 
in Table 2, i.e., the interaction re- 
vealed by analysis of log mean score 
over all trials, is the result of the same 
patterning among the means that 
occurred on Trial 1 (where the inter- 
action though not 
greater than zero). Both in Fig. 1 
and in Table 2, when rest was the 
standard task, the largest difference 
occurred between alphabet and ana- 
grams as comparison tasks, but when 
anagrams were the standard task, the 
largest difference occurred between 
rest and alphabet as comparison 
tasks. In addition, there is a re- 
versal (not significant) in Table 2 
between the alphabet and anagram 
means for standard task when rest 
was the comparison task. 

Table 2 shows that the means of 
the three groups whose standard and 
comparison tasks were the same, in- 
crease as one proceeds from Rest- 
Rest to Anagram-Anagram  condi- 
tions, as was also the case on Trial 1. 


significant, was 


This increase suggests that the com- 
parison task had a more powerful 
influence on estimates than did the 
standard task (note also that F values 


TABLE 2 


MEAN OF LOG MEAN TIME EsTIMATI 
ALL TRIALS FOR Eacu Group 


OVER 


Comparison Task 
Standard Task 
»yhabet Anagrams 
Rest 
Alphabet 


Anagrams 
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were always larger for comparison 
task); as might be expected, the task 
S is working on at the time he must 
make his estimate is probably the 
more important. Perhaps it is also 
worth mentioning that these three 
same-task groups contributed to the 
heterogeneity of variance found in 
log scores over all trials; as compared 
to the other six groups, the variances 
of the three same-task groups were 
very small (and nearly identical). 

At the end of the experiment Ss 
were asked a number of questions; 
their answers yielded the following 
information. The Ss estimated the 
length of the standard interval from 
as low as 2 to as high as 120 sec. 
Different Ss thought the standard 
interval remained constant, progres- 
sively lengthened, progressively short- 
ened, varied randomly. The Ss were 
also asked what they thought about 
the task or tasks they had worked on. 
It was clear from Ss’ remarks that 
the great majority of them rather 
strongly felt that the rest task was 
unpleasant, boring, progressively more 
boring, and intellectually dull, while 
the task felt to be 
pleasant, interesting, and challenging. 
The alphabet task did not tend to 


anagram was 


elicit strong feelings one way or the 
other. 


Disc USSION 


The first purpose of the study was to 
test the prediction that times estimates, 
defined here as the time S worked on the 
comparison task, would 
with the “‘level of 
standard task, but with 
the level of behavior of the comparison 
task. This 
not only on 


inversely 
’ of the 
directly 


vary 
behavior’ 
Vary 
prediction was confirmed, 
Trial 1, but mean 
score over all trials, even though there 
was a significant interaction in this latter 
(all-trials) score. 


also on 


It is also worth noting 
that the difference between our extreme 
conditions (Rest-Anagramsvs. Anagrams 
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Rest) is in the same direction as that 
found by Hirsh, Bilger, and Deatherage 
(1956) when their Ss were subjected to 
quiet or during standard and 
comparison intervals; quiet-to-noise pro- 


noise 


duced much longer time estimates than 
noise-to-quiet. Furthermore, both stud 
ies found that this 
tion’ effect was 

extreme 


“‘order-of-presenta- 
reduced when 
along the dimension 
(level of behavior in the present study, 


Hirsh 


less 


points 


decibel level of the noise in the 
study) were compared. 

The second purpose of the study was 
to check a finding of Kleiser (1953), who 
reported that time 
progressively longer 
trials. Although 
confirmed, it should be 
Kleiser varied only 
his comparison task always 
Because of this, only two of the nine 
groups of the present study, Alphabet 
Rest and Anagram-Rest, are comparable 
to Kleiser’s groups 


estimates became 


over a series of 
this finding was not 
that 
task; 


rest. 


noted 
the standard 
was 


Inspection of the 
groups 
alone at the five trend points (blocks of 


mean time scores of these two 
three trials) did suggest that time esti 
mates were becoming progressively longer 
over trials, although only slightly so 
Therefore, these two groups were com 
bined and the scores analyzed for trend 
The analysis yielded an F for trend of 
3.36, which is significant at the 5% level 
with 4 and 116 df. Thus, 
some evidence, although not 
that time tended to 
longer when tasks at 


were repeatedly 


there was 
strong, 
estimates become 
higher levels of 
followed by 


This finding does not seem to be 


behavior 
rest. 
a major exception to the over-all finding 
that time 
constant over 


estimates remained quite 


trials. Since appropriate 


control groups were not used, any pro 
gressive changes in time estimates over 
trials in either Kleiser’s or the present 
study can only be said to be concomitant 
with, not 


repeated experience with the procedure, 


due to, such variables as 
improvement in task performance, et 

suggested that the 
comparison task exerted a more powerful 
time than the 
task. Further support for 


It was earlier 


influence on estimates 


standard 
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this suggestion can be obtained by com- 
paring the slopes of the curves (not 
shown) that appear if one plots the grand 
means for each task variable separately. 
For example, on Trial 1 (where the 
nonsignificant interaction permits one to 
combine groups on the other variable), 
the grand mean of log scores decreased 
from 1.54 with rest to 1.33 with ana- 
grams, when these tasks were used dur- 
ing the standard interval. When the 
same tasks were used during the com- 
interval, the grand mean in- 
1.26 with rest to 1.59 for 
Thus, of level of 
behavior during the comparison interval 
seemed to have a somewhat greater effect 
than the same during the 
standard interval. 

Finally, it is suggested that the three 
tasks used here do represent points along 


parison 
creased from 


anagrams. variation 


variation 


the poorly-defined dimension, level of 
behavior, at least insofar as they influ- 
ence estimation of time. The tasks were 
originally selected because they seemed 
similar to the tasks used by Axel (1924) 
and Gulliksen (1927), ice., similar 
to rest or tapping a pencil, reversed 
alphabet printing 
canceling or 
solving 
division. 


rest 


number 
anagram 
analogies or long 
Gulliksen’s tasks 


similar to 
mirror reading, 
similar to 
Axel’s or 


were not used because it was thought 
that the use of new tasks would, if the 
results were in the predicted direction, 
provide greater generality to the dimen- 


level of behavior. Since the 
major results were in the expected direc- 
tion, the hypothesis that the present 
tasks do along the dimension 


receives some support. 


sion of 


Vary 


SUMMARY 


Each S was given several trials in each of 
which he first worked on a standard task for 
a fixed interval of time, then worked on a 
comparison task until he estimated he had 
worked the same length of time as had elapsed 


during the fixed interval. Three tasks 
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rest, reversed alphabet printing, and anagram 
solving—were used in all combinations as 
both standard and comparison tasks. These 
tasks were assumed to represent, respectively, 
low, intermediate, and high degrees of a 
dimension called “level of behavior.”’ 

The results were: (a) As expected, time 
estimates varied directly with level of be- 
havior of the comparison task, inversely 
with level of behavior of the standard task. 
These effects occurred and above a 
significant interaction, found with one meas- 
ure, between standard and comparison task 
variables. (6b) In general, time estimates 
did not vary over trials, although there was 
some evidence that estimates became progres- 
sively longer when standard tasks at higher 
levels of behavior were repeatedly followed 
by rest as the comparison task. 


over 
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This past decade has seen a wealth 
of important investigations of word 
meaning and word association. As 
yet, however, very little is known 
about the variables of which strength 
of semantic and 
tion between function. 
This study is concerned with one 


associative connec- 
words is a 


limited aspect of this problem, namely, 
the role of the logico-grammatical 
relation between words as a possible 
determinant or concomitant of judged 
semantic and associative connection. 
In a recent study, Flavell (1956) 
compared schizophrenics and normals 
as to judgments of semantic similarity 
among pairs of words bearing various 
logico-grammatical relationships to 
each other. Benedetti (1958) repli- 
cated this experiment with college 
student Ss, obtaining judgments of 
associative strength as well as mean- 
ing similarity. 
tion, however, 


In neither investiga- 
was it possible to 
assess in any rigorous way the extent 
to which strength of semantic or 
associative connection was a function 
of the relation between two words, 
above and beyond the specific content 
meanings of each word in isolation. 
The present 
tematic attempt to investigate the 
role of word-word relations, apart 


udy is a more sys- 


' Part of this study was supported by a 
research grant (M-2632A) from the National 
Institute of Mental Health, Public Health 
Service. The authors wish to thank Alfred 
Bauer and Douglas Yeaple for their assistance 
in tabulating and analyzing data. 


word content, in three kinds 
of judgments of verbal connection: 
similarity of meaning, relatedness of 


meaning, and 


from 


associative strength. 
Similarity and associative judgments 
were selected for study because of the 
obvious centrality of semantic and 
verbal habit connections in the area 
of verbal behavior; the inclusion of 
‘relatedness of meaning”’ as a variable 
requires comment, however. In a 
recent paper, Bousfield, Whitmarsh, 
and Berkowitz (1958) have suggested 
that “word relatedness” is a 
fundamental and inclusive bond be- 
tween words than meaning similarity 
and should be a more potent deter- 
minant of mediational effects such as 
verbal transfer and clustering. Spe- 
cifically, they that common 
antonyms, known to show 


more 


argue 
strong 
transfer and clustering effects, ought 
to be considered as related but dis- 
similar in meaning. ‘Thus, related- 
ness instructions were included in this 
study to find out whether, in naive Ss, 
they do, in fact, lead to different 
judgments from those given under 
similarity instructions. 

Since the experimental procedure used was 


somewhat detailed, a 
orienting follows 


complex and brief 


summary Eleven cate 
gories of relationships were used, each com- 
prising a pair of words bearing the defined 
relationship to each other. Each pair could 
then be paired once with each other pair, 
thus making 55 pairs of pairs, called here 


a “set.” A second set of 55 pairs could be 
constructed in using new 


word pairs to illustrate the same 11 categories 


the same way by 


159 
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From the point of view of the logico-gram- 
matical relationships involved, all sets con- 
structed in this way are equivalent; from the 
standpoint of the particular word pairs used 
to exemplify these relationships, all sets are 
different. In this study, five distinct sets 
were used, thereby constituting a basic “test”’ 
consisting of 275 pairs of pairs. The S’s 
task, depending upon which group he was in, 
was to choose from each pair of pairs the one 
whose members seemed more similar in mean- 
ing, more closely related in meaning, or more 
closely associated. Since, within each set, 
was paired once with each 
other category, such choices in effect produce 
a ranking of categories (1 to 11) for each of 
the three judgments. 


each category 


The rankings obtained in this way 
permit study of each type of judg- 
ment, and comparisons among judg- 
ments, with respect to the following: 
(a) extent of agreement among Ss 
in judgments within each set; (6) 
more important, extent to which 
rankings are similar across the five 
sets—if they are similar for a given 
judgment, it can reasonably be con- 
cluded that level of judged connection 


is in part a function of the relationship 
rather than the separate words them- 


the obtained rank hier- 
archies of relationships resulting from 
the various judgments. 


selves; (c) 


METHOD 


Subjects—The Ss were 222 sophomores 
enrolled in an introductory psychology course 
They were rewarded with examination credit 
Each of the three judgment groups consisted 
of 46 women and 28 men. 

Test materials.—The 11 categories of 
logico-grammatical relationships are defined 
below. The five pairs of words used in this 
study to exemplify each relationship follow 
the definitions; the first pair listed after a 
given definition belongs to Set 1, the second 
to Set 2, etc.: 

1. Synonym (SY). The members of each 
word pair have exactly or almost exactly the 
same referent (big 
automobile, couch—sofa, error—mistake). 

2. Similarity (Sl). The members of each 
word pair are similar in meaning in the fol- 
lowing way: the referents of both are closely 
aligned on some dimension, with the referent 


large, steal---rob, car 
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of the right-hand member occupying the more 
extreme position on this dimension (small 
tiny, smile—laugh, cool—cold, hungry 
ing, damp—wet) 

3. Supraordinate (SU). The left-hand 
member of each word pair denotes a common 
class of which the referent of the right-hand 
member is a familiar class member, easily 
recognized as such (bird 
fox, furniture—table, fruit—apple, tree—oak). 

4. Coordinate (COO). The members of 
each word pair refer to familiar, easily recog- 
nized class members of a familiar class (cow 
horse, pipe—cigar, chair—table, knife 
beets—peas). 

5. Attribute (AT). The right-hand mem- 
ber of each word pair refers to a quality or 
attribute generally recognized as always or 
nearly always characterizing the object de- 
noted by the left-hand member (lemon 
elephant—heavy, icc cold, turtle 
mouse—small). 

6. Contrast (CON The members of each 
word pair refer to opposite ends of a familiar 
continuum (hard—easy, weak, old 
young, tall—short, loud—-soft). 

7. Action-of (OF). The right-hand mem- 
ber of each word pair is an intransitive verb 
denoting a concrete action commonly associ- 
ated with and frequently performed by the 
agent referred to by the left-hand 
(dog—bark, rabbit—hop, 
cry, cat—purr). 

8. Action-upon(UP). The left-hand mem- 
ber of each word pair is a transitive verb 
denoting a concrete action commonly associ- 
ated with and frequently performed upon the 
object referred to by the right-hand member 
(sweep—floor, chew hands, eat 
bread, throw ball) 

9. Whole-part(WP). The right-hand mem- 
ber of each word pair refers to a familiar object 


Starv- 


Sparrow, animal 


Spoon, 


sour, 


cube slow, 


strong 


member 


lion roar, baby 


gum, wash 


easily recognized as an important part of the 
familiar whole denoted by the left-hand mem- 
ber (bird—wing, shoe—heel, hand 
tree—branch, bicycle—wheel). 

10. Part-part (PP). The members of each 
word pair refer to familiar objects easily 
recognized as important parts of a familiar 
whole ( pedal—handlebars, wall—floor, collar 
sleeve, headlight—brake, arm—head). 

11. Common use (CU). The right-hand 
member of each word pair denotes an object 
commonly with and frequently 
acted upon by the agent referred to by the 
left-hand member (dog—bone, farmer—tractor, 
baby—rattle, carpenter—nail, soldier—rifle). 

These particular 11 categories, culled from 
various sources, were selected for study be- 
cause they were thought to be fairly unam- 


finger, 


associated 
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biguous in definition, diverse in character, 
and frequently encountered in everyday 
language situations. They, of course, con- 
stitute only a small sample of the possible 
semantic relationships between words which 
could be abstracted from our language system. 
The word pairs used to represent each cate- 
gory were selected with the help of graduate 
student and staff judges in accordance with 
two guiding principles: (a) the words should 
be relatively common, familiar ones; (b) the 
five pairs for a given category should appear 
to display the defining characteristics of that 
category about equally well. Despite careful 
selection procedures, however, some difficul- 
ties remain; e.g., it is intrinsically more 
difficult to find homogenous pairs for some 
categories than for others, some word pairs 
are related in ways other than those specified 
by the category definitions. Problems of this 
kind are probably indigenous to any attempt 
to impose rigidly defined categories of seman- 
tic relationships upon a complex language 
system (Flavell, Draguns, Feinberg, & Budin, 
1958). 

Procedure —The test taken by each S 
consisted of a page of detailed instructions 
(different for each judgment group) followed 
by 286 pairs of word pairs.2. Of these 286, 
the first 11 were practice items; the remaining 
275 were the critical ones for which choices 
were scored. The 275 items appeared in the 
test form in a random order which was the 
same for all Ss 

The Ss performing under similarity-of- 
meaning instructions were instructed to select 
from each two-pair item the pair whose mem- 
bers seemed to them “closer in meaning.” 
For the word relatedness group, the instruc- 
tions were essentially the same except that 
phrases like “more closely related in mean- 
ing,” related to each other’ were 
substituted for “‘closer in meaning,” ““A comes 
closer to meaning B than C does to D,”’ etc. 
For the association group, the 
were somewhat more detailed. 


“more 


instructions 
A hypothet- 
association situation 
described in which an 


gives a sequence of 


ical continuous was 
imaginary individual 
a single 
The instructions then speci- 
this individual, would 


associations to 
stimulus word 
fied that, for one 

2 All test material and instructions de- 
scribed in this paper have been deposited 
with the American Documentation Institute. 
Order Document No. 5 956, remitting $1.75 
for 35-mm. microfilm or $2.50 for 6 X 8 in. 
photoprints. Make checks or money orders 
payable to: Chief, Photoduplication Service, 


Library of Congress, Washington 25, D. C. 
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assume that the first associate given is most 
closely associated with the stimulus word, the 
second associate is next most closely associ- 
ated, etc. The S was then directed to choose 
that pair of words from each pair of pairs 
which he thought would be the more closely 
associated for him, in the sense defined above. 
Out of practical necessity, the test was 
self-administered on a take-home basis. The 
instruction sheet, in addition to specifying 
the task, also strongly exhorted S to take the 
test in one sitting that same evening, and 
without consultation with others. 
Scoring.—Within each set, each S received 
a rank score (1 to 11) for each category based 
to 0) with which he 
chose this category in preference to the others 
with which it had been successively paired, 
All data analyses were based on these ranks. 
Further experimentation.—I\n order to sup- 
plement similarity judgments, a 
second, minor experiment was conducted. 
Seven men and 19 women, mostly junior and 


on the frequency (10 


data on 


senior psychology majors, were tested indi- 
vidually. These 26 Ss were given the 11 
category descriptions (somewhat less formally 
worded than the definitions presented above) 
plus five “examples” of each, consisting of the 
same word pairs used in the major study. In 
essence, the Ss were instructed to rank the 
categories in terms of the generalized simi- 
larity of meaning they felt to be inherent in 
Care was taken 
to make the instructions as similar as possible 
to the similarity instructions in the major 
study. The data from this experiment made 
possible comparisons between similarity rank- 


each category relationship 


ings derived from paired comparisons of 


concrete, individual word pairs, and rankings 
made directly on the basis of abstract defini- 
tions of relationships coupled with these same 
word pairs as concrete examples. 


RESULTS 


The analyses described below were 
first performed on the data for each 
sex separately; since no meaningful 
pattern of statistically significant sex 
differences emerged, the analyses 
which follow involve total judgment 
groups only. Table 1 shows coefficients 
of concordance (W) for each judgment 
group (S = similarity, R 
ness, A = association) expressing rank 


= related- 


agreement among Ss within each set 
separately and across the five mean 
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TABLE 1 
KENDALL COEFFICIENTS OF CONCORDANC! 
(W) ExpressSING AGREEMENT ON Cart- 
EGORY RANK ORDER WITHIN AND 
Across SETS BY JUDGMENT 


Within Sets 
Judg Across 
ment Sets 
. Set 4 
A9 oe .86 
42 of 84 
.26 oe B 


set rankings (Siegel, 1956). As the 
“within-sets’’ portion of the table 
shows, inter-S agreement is for each 
set highest for similarity judgments, 
next highest for 
ments, and association 
judgments. Even the lowest Ws in 
the table are highly reliable, however 
(P < .001). The across-set Ws un- 
der S and R instructions are sub- 
stantial and about equal; the cor- 
responding Ws for A judgments are 
definitely lower than the others. The 
group of 26 Ss who made similarity 
rankings among categories directly 
from definitions and examples (S- 
Abst) showed an inter-S W_ (.58) 
comparable to the within-set correla- 
tions of the S group. 

Table 2 shows the means of the 
individual set mean rank orders for 
each of the four judgment groups. 
Rank order correlations among the 
four rows of this table are as follows: 
S-Abst—S = .95; S-Abst—R = .93; 


relatedness judg- 
lowest for 
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S—R = .99; S-Abst—A = .40; S—A 
= 49; R—A = .57. It is clear that 
S-Abst, S, and R instructions produce 
highly similar category rankings, while 
A instructions produce a_ hierarchy 
decidedly different from any of these. 
Inspection of Table 2 suggests that 
relationships SY, SI, and SU show a 
relative drop in rank position under 
A instructions while OF, UP, and 
CON rise in rank. 

In summary, similarity and related- 
ness instructions seem to have almost 
identical effects upon the judgment 
parameters studied here, with the 
exception of an apparent difference 
in inter-S agreement within sets. In 
both cases there is good evidence that 
a given relationship tends to maintain 
its rank status in the hierarchy despite 
variation in the words used to express 
the relationship. Furthermore, the 
two instructions produce highly simi- 
lar relationship hierarchies. In the 
case of association instructions, on 
the other hand, inter-S agreement is 
lowest of all, rank status of relation- 
ships is less predictable from set to 
set, and the mean rank hierarchy of 
relationship differs substantially from 
that produced by the other two 
judgments. 


DISCUSSION 


Although the data presented in Tables 
2 and 3 suggest that S and R instructions 
were, in all essentials, functionally equiv- 
alent in this study, it is not clear why 


TABLE 2 


MEAN Across-SET RANK ORDER AMONG CATEGORIES BY JUDGMENT 


Judgment 


Wm Nm hy 
nee 


ategory 
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this was so. It might have been ex- 
pected, following the reasoning of Bous- 
field, Whitmarsh, and Berkowitz (1958), 
that Ss would respond to instructions 
stressing meaning relatedness by pro- 
moting contrasts in the relationship 
hierarchy. It may, of course, be that 
changes in instruction wording, increas- 
ing the range and diversity of word rela- 
tionships, etc., might well produce judg- 
ment differences not apparent in this 
study. On the other hand, it may 
simply be that the presence of the term 
‘“‘meaning,”’ coupled with a general set 
towards comparing distances between 
words, tends to produce in naive Ss a 
global strategy unaffected by connota- 
tive differences among expressions like 
“close or similar in meaning,” 
related in meaning,” etc. 
Granted that the two judgments did 
produce similar strategies in this study, 
strategies which showed a certain con- 
stancy across sets and Ss, it is interesting 
to speculate as to what these might have 
been. One possibility is that most Ss 
tended to adopt the strategy of judging 
words showing abstract, ‘ 


‘closely 


‘similarity” 
relationships to be more related or similar 
than those expressing concrete, ‘‘con- 
tiguity”’ relationships (Flavell, 1956; 
Jakobson, 1955). Although the obtained 
hierarchy lends some support to this 
interpretation, there are some difficul- 
ties, e.g., WP and AT should be lower 
than SU in the hierarchy and this is not 
the case. Furthermore, such a strategy 
would not account for systematic rank 
differences among relation- 
ships or among similarity relationships. 
As one interesting example, subjects 
and their characteristic actions (OF) 
were almost always judged to be more 
similar and related than actions and their 
characteristic objects (UP). 


contiguity 


As to the association judgment results, 
it seems reasonable to suppose that the 
lower level of within- and across-set agree- 
ment resulted from the fact that the 
instructions were more oriented towards 
idiosyncratic, immediate reactions in the 
individual S than was the case for R and 
S judgments. 
parisons were not possible in his study, 


Although across-set com- 
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Benedetti (1958) also showed that asso- 
ciation judgments yield inter-S 
agreement than meaning judgments. 
However, the relatively low correlations 
between A judgments and each of the 
other three (.40, .49, .57) are at variance 
with the results of an earlier study 
(Haagen, 1949). Haagen, using S and 
A instructions similar to used , 
here, found a correlation of .90 across 
adjective-adjective word pairs ranging 


less 


those 


from near-perfect to rough synonymity. 
This suggests that the relationship be- 
tween semantic and associational judg- 
ments is a complex one which may, 
among other things, depend upon the 
form class of the words judged and the 
range of sampled 
(Haagen’s range is considerably narrower 
than that studied here). Finally, it has 
been shown that the hierarchy of rela- 
actually obtained under A 
instructions contrasts with the semantic 
hierarchies in several respects. That 
categories such as OF, UP, and CON 
should have higher rank status under A 
instructions appears reasonable, in view 
of the allegedly verbal- 
habit character of associational processes. 
Again, Benedetti (1958) showed 
that association instructions favor verb- 
noun relations. What is surprising, how- 
is that coordinates and contrasts 
were not selected even more frequently 
than they were, in view of their very 
high frequency as responses in ordinary 
word association (Woodworth & Schlos- 
berg, 1954). 
gests considerable caution in interpreting 


semantic distance 


tionships 


transitional, 


also 


ever, 


Evidence of this sort sug- 


judgments of associational strength as 
directly equivalent to operant 
association as a means of 
verbal habit connections. 
According to one phrasing of the pres- 
ent findings, it has been shown that the 
semantic distance between two words is 


word 
assessing 


roughly predictable from prior knowledge 
of the logico-grammatical 
uniting them. 


relationship 
may 
be that these relationships do not really 


However, it well 
determine level of judged distance di 
rectly, but only through the operation 
of real which they 
are correlated or confounded. For ex- 


determinants with 
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ample, all pairs like bird—wing, shoe— 
heel, tree—branch, may be judged to be 
of approximately similarity or 
actual deter- 
minant or determinants of such judg- 


equal 
relatedness because the 
ments happen to operate at a more or 
less constant level of strength for all WP 
relationships. What might these de- 
terminants be? One possibility is that 
S responds in terms of some complex 
weighting of attributive or connotative 
similarity. Osgood, Suci, and Tannen- 
baum (1957, pp. 143-146) report a study 
by Rowan in which similarity judgments 
were successfully predicted from seman- 
tic differential D scores. Another possi- 
bility is that S makes his judgment in 
terms of a similarly complex weighting 
of implicit word association responses 
which the two judged words elicit in 
common. Cofer, in a recent experiment 
(1957), has lent plausibility to 
this interpretation. Still another possi- 
bility, as yet hardly explored, is that an 
S judges two words to be similar in mean- 
ing to the extent that their referents tend 
to co-occur, “in nature,’’ in the same 
context. In a recent 
pilot study, the senior author presented 
in a fixed random order the 22 
pairs of Sets 1 and 2 to 15 graduate 
students under instructions to rate co 


some 


spatio-temporal 


word 


occurrence of referents on an 11-point 
Each S first assumed an occur 
rence of the referent of A in an A-B pair 
and then rated the probability that B’s 
referent would occur in the same context 
at the same time. 


scale. 


After each word pair 
had been rated in this way, S made similar 
ratings of referent A’s probability of co- 
occurrence, given referent B. The ‘‘co- 
occurrence score” for a pair was simply 
the sum of these two ratings. Inter-S 
agreement in this judgment task was 
comparable to inter-S agreement under S 
instructions: for Set 1, W = .59; for 
Set 2, W = .46. Rank order correla- 


tions between co-occurrence and S judg- 
ments were .78 for Set 1 (P < .01) and 
.56 for Set 2 (P < .05). 
tion holds up over a more extended range 
of semantic 


If this correla- 
distances, co-occurrence 
judgments may prove a fruitful third 
approach. Our that 


present guess is 
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co-occurrence estimates, to the 
that they 
operate 


extent 
operate at all, 
multiplicatively 


probably 
with one or 
more other determinants in producing 
distance judgments. 


SUMMARY 


The primary purpose of this investigation 
was to find out whether the logico-gram- 
matical relationship obtaining between two 
words is a partial determinant of judged 
similarity in meaning, judged relatedness in 
meaning, and judged strength of associative 
connection. Five sets of 11 word pairs each 
were constructed, each pair in a given set 
showing one of 11 such relationships. Each 
of the 11 relationships was thus embodied in 
five different word pairs, one in each set. 
The Ss made paired comparison judgments 
of similarity, relatedness, or associative con- 
nection on the 55 possible two-pair combina- 
tions of word pairs within each set, thus 275 
judgments across all five sets. Intragroup 
agreement among Ss within each set was 
highest in the case of similarity judgments, 
next highest for relatedness judgments, and 
lowest for association judgments. For both 
similarity and relatedness instructions there 
was a decided tendency for a yiven relation- 
ship to maintain its rank status across sets 
despite set to set variation in word content; 
rank stability under association 
instructions. Similarity and relatedness in- 
structions produced highly similar rankings 
of relationships; association judgments yielded 
a relationship hierarchy quite different from 
both. It was concluded that, for similarity 
and relatedness especially, judgments bearing 
on two words may be at least roughly pre- 
dictable from prior knowledge of the logico- 
grammatical relationship involved. 


was lower 
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PERTURBATION ON TRACKING PERFORMANCE ! 
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In order to exercise vehicular con- 
trol, S must be supplied with informa- 
tion regarding the course to be 
followed—the input signal (I) and 
the result of own actions—the 
feedback signal (R). The pursuit 
type display provides | and R infor- 
mation separately by moving elements 
called the target the cursor, 
respectively. The compensatory type 
display provides one fixed and one 
moving element, the difference be- 
tween these two indicators represent- 
ing error or the algebraic difference 
between I and R signals. 

In a recent experiment, Poulton 
(1957) noted that whereas any deg- 
radation of information on a pursuit 
display resulted in a deterioration of 
tracking performance, the extent of 
performance decrement was a function 
of the locus of degradation. Thus, 
tracking was least proficient when the 
I information was subject to inter- 
mittent presentation, while perform- 
ance was relatively less affected when 
the R information was degraded by 
intermittence. When the I and the 


his 


and 


R signals were simultaneously de- 
graded, tracking performance approxi- 
mated that found when the I informa- 


1 This research was conducted in the Labo- 
ratory of Aviation Psychology of Ohio State 
University and was supported in part by the 
United States Air Force under Contract No. 
AF 41(657)-70 with the Ohio State Univer- 
sity Research Foundation, monitored by 
M. R. Rockway, who served as Scientific 
Officer for the Aero Medical Laboratory, 
Wright Air Development Center. Permis- 
sion is granted for reproduction, publication, 
use, and disposal in whole or in part by or for 
the United States Government 
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tion appeared intermittently on the 
display. 

If findings are truly 
indicative of the relative importance 
to tracking accuracy of the input and 
feedback information on a visual dis- 
play, then a similar trend should result 
when other forms of display degrada- 
tion are present. One source of 
degradation is visual noise which ap- 
pears as random movement of the 
display element(s). This movement 
is uncorrelated with and independent 
of the input signal itself. It has 
been demonstrated that visual noise 
has a detrimental effect on tracking 
with a compensatory display (Briggs 
& Fitts, 1956; Briggs, Fitts, & 
Bahrick, 1957). When confronted 
with such a signal-in-noise condition, 
S is required to estimate and track 
the more coherent signal and dis- 
regard or “filter out’ the random 
(noise) component of the displayed 
information. 

The present investigation, then, was 
undertaken to test the generality of 
Poulton’s findings for degradation 
other than intermittence, and for a 
compensatory as well as for a pursuit 
display. Visual noise was added to 
the elements of a pursuit tracking 
display in the following manner: (a) 
noise on the target or input element 
(1) only, (0) the cursor 
element (R) only, and (c) noise on 
both I and R_ simultaneously. A 
fourth signal-in-noise condition was 
obtained by adding visual noise to 
the error signal on a compensatory 
tracking display (E). Further, the 
variables of input complexity and 


Poulton’s 


noise on 





VISI 


amplitude of visual noise were in- 


cluded in the design to permit a more 
genera! interpretation of the results. 


METHOD 


Apparatus and scoring.—The S viewed a 
5-in. cathode ray tube (CRT) display of 
tracking information, and he utilized a one- 
dimensional! control device to generate his 
tracking response. A change in position of 
the control device produced an immediate 
and directly proportional change in the posi- 
tion of a cursor on the CRT. The control- 
display sensitivity such that a 10 
horizontal rotation of the control deflected 
the cursor 1 in. on the CRT. The input 
signals were displayed with a peak-to-peak 
amplitude of 4 in.; thus, S was required to 
rotate the control device through an arc of 
+20° from its center position. This latter 
position was set at an angle of approximately 
50° measured counterclockwise from the 
vertical plane of S’s thorax, and the axis of 
rotation was at S’s elbow. The control was 
not spring-centered and was subject to mini- 
mal friction. The tracking task was 
dimensional and the CRT display contained 
two partially overlapping vertical lines, the 
target and the cursor elements, which moved 
only along the left-right axis. Twe display 
modes were employed: a display 
and a compensatory tracking 
information. 

Performance was measured by integrating 
the absolute values of tracking error during 
each trial. The error signal was scored prior 
to that point in the system at which the noise 
signals were introduced. 
appeared in the scoring circuits as a voltage, 


was 


one 


pursuit 
display of 


Since tracking error 


it was desirable to transform these raw error 
scores from arbitrary units to a meaningful 
metric expressed in scale units matching those 
of the tracking display, i.e., inches. Thus, 
the raw integrated error scores were divided 
by time, T = 60 sec., to provide a metric 
analogous to the average deviation of S's 
°T 

7 I e\dt 
his metric is termed average error. These 
average error scores were then transformed 
from arbitrary voltage units to a linear scale 
of inches of CRT movement. To illustrate, 
a score of .5 in. is interpreted as the average 
deviation of S’s error amplitude distribution 
if that distribution were drawn to the same 
scale as that of the visually displayed error. 

Experimental conditions.—Three variables 
were manipulated systematically in this 


error amplitude distribution, 
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study: (a) input (two levels), 


(b) magnitude of visual noise (three levels), 


complexity 


and (c) locus of visual noise (four posjtions), 
a total of 24 combinations of experimental 
conditions. In addition, four noise-free con- 
ditions were included to provide a base line 
of tracking performance. Thus, S tracked 
both inputs with both the pursuit and com- 
pensatory display under noise-free 
display conditions. The two levels of input 


complexity included a single 10-cpm course 


modes 


and a more complex course consisting of a 
combination of 10 and 30 cpm sinusoids of 
equal magnitudes and zero phase angle. In 
either case, the peak-to-peak excursion of the 
input signal was +2 in. on the CRT The 
three levels of noise magnitude, N,;, Nz and 
N;, were described by random signals with 
normal amplitude distributions and average 
deviations of .03, .05, and .11 in., respectively. 

The noise signals were produced by an 
electronic random signal (Clark, 
Fontaine, & Warren, 1953). The signals were 
peaked at 30 cpm with a bandwidth of 15-60 
cpm. Attenuation at the bandwidth was 24 
db per octave. 


generator 


The above estimates of aver- 
records 
Of the four loci of visual 
noise, three appeared on the pursuit display 
so as to show perturbation ol (a 
element only (1), (0) 
(R), and (c) 
taneously. 


age deviation were based on 15-min 
of the noise signals 


the target 
the cursor element only 
both | and 
As the fourth locus of perturba- 
tion, visual noise was added to the tracking 
error signal (E) on the compensatory display. 
When visual noise appeared simultaneously on 
the I and R elements, the noise signal added 
to the I element was statistically equivalent 
to but independent of that added to S's 
output (R) 

Subjects and procedure 
undergraduate volunteers were paid for ser- 
vice in this experiment. Each S participated 
in one practice and four experimental sessions, 
and over the latter, each S tracked under all 
24 combinations of noise amplitude, noise 
locus, and course complexity rhe initial 
portion of the practice session consisted of 
16 30-sec. tracking trials under No conditions 
This enabled S to approach asymptotic levels 
of proficiency under noise-free conditions for 
both pursuit and compensatory display modes 
with both the simple and the complex inputs 
These trials were followed by 16 65-sec. trials 
also under N» conditions. There were four- 
trial blocks for each of the four combinations 
of display mode and input complexity. The 
performance measured during these 16 trials 
established the base line against which to 
compare proficiency under the N,, No, and 


R elements simul- 


I'wenty-four male 
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N; noise conditions (see No in Fig. 1 and 2). 
The scheduling of an additional set of 16 No 
trials in this first session was further assurance 
that S had attained near-asymptotic skill at 
the basic tracking task 

Each of the remaining four sessions in- 
volved a total of 24 65-sec. trials. 
visual 


Locus of 
defined these four sessions (I, 
R, I and R, and E), and within each session S 
tracked four consecutive trials on each of the 
six combinations of input complexity and 
noise amplitude. An intertrial rest interval 
of 25 sec. was used with a 2-min. rest between 
blocks of four trials. Order of the four noise 
loci was counterbalanced across the last four 
experimental via a latin square 
design, while order of conditions within a 
session was determined from a table of random 
numbers. The use of 65-sec. trials permitted 
a 5-sec. warm-up period prior to the scoring 
of tracking performance 

Instructions to S emphasized that he was 
to disregard the random variations in position 
of the display element(s) and track the more 
coherent signals. During the experimental 
sessions S was given explicit instructions 


noise 


sessions 
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concerning the particular condition to follow 
in each block of trials, and he was reminded 
frequently to “track through the noise.” 
Knowledge of results in the form of integrated 
absolute error was given verbally immediately 
following each trial, and during the intertrial 
rest intervals S was shown (but not permitted 
to track) the noise-free display and the noise- 
free input to be tracked on the subsequent 
trials. 


RESULTS 


The major results of this study are 
summarized in Fig. 1 and 2, where 
tracking proficiency is shown as a 
function of the magnitude of visual 
noise with locus of perturbation as 
the parameter. The data summarized 
in Fig. 1 were taken with the single 
10-cpm input, while Fig. 2 represents 
tracking the more complex 10 + 30 
cpm course. Each point in these 
figures is the sum of four 60-sec. 
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and locus of perturbation. 


trials for each S, averaged over the 
24 Ss. The average deviation metric 
discussed above 
performance here. 

From visual inspection of Fig. 1, 
it is apparent that, as expected 
(Briggs & Fitts, 1956; Briggs et al., 
1957), an increase in the magnitude 
of visual noise resulted in decrements 
in tracking proficiency. Further, the 
extent of such decrement is dependent 
on the locus of perturbation: adding 
noise signals to the cursor element 
(R) produced the least decrement in 
tracking proficiency, while noise on 
the target element (I), on the I and 
R elements simultaneously, or on the 
(E) resulted in a far 
greater reduction in proficiency. The 
results with the complex 


serves to describe 


error signal 


obtained 


input are comparable in this respect 


(see Fig. 2). 


In order to illustrate this differen- 
tial effect of visual noise, a linear equa- 
tion was fitted to each of the eight 
functions shown in Fig. 1 and 2. 
Such a function provides a reasonable 
fit over the range of noise magnitudes 
studied here. However, data reported 
elsewhere (Briggs & Fitts, 1956) 
indicate that the initial portion of 
these functions is positively acceler- 
ated. Thus, the use of a linear equa- 
tion here is justified an 
expedient to demonstrate differences 


only as 


in slope of the several noise locus 
functions, and does not suggest the 
general relationship between tracking 
error and noise magnitude. Table 1 
presents the slope constants for these 
curves as determined by least squares 
fits. It is apparent from the table 
that for the simple input, the I and 


the I and R curves have nearly 
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TABLE 1 


SLOPE CONSTANTS FROM A LEAST SQUARES 
Fit oF LINEAR EQUATIONS TO THE 
FUNCTIONS OF FIG. 1 AND 2 


Input Complexity 
Noise Locus 


10 cpm 10 + 30 cpm 


Input (1) 1.8 
Response (R) 0.6 
L&R 2.1 
Error (E) 2.2 


identical slopes (1.8 and 2.1, respec- 
tively), while that for the E condition 
is only slightly greater (2.2). In 
marked contrast, however, the curve 
describing the R condition shows a 
slope constant of only .6. With the 
complex input, nearly identical slopes 
were found for the I, I and R, and E 
conditions: 1.4, 1.3, and 1.3 respec- 
tively. The R _ condition for the 
complex input has the slope 
(0.3). 

Finally, it is interesting to note that 
the slope constants for the simple 
input are uniformly higher than those 
for the complex input. This results, 
of course, in a convergence of the 
functions, for comparable noise locus 
conditions, at the higher magnitudes 
of visual noise. This result was not 
predicted, since, within the range of 
noise amplitudes sampled, the in- 
creased complexity of the 10 + 30 
cpm input was expected to make it 
more difficult for S to estimate this 
signal when obscured by noise as 
compared with the simple 10 cpm 
course. An a posteriori explanation 
for this finding is based on the fact 
that if S did not move his control 
device at all during a tracking trial, 
his error score would be lower for the 
more complex input. As calibrated, 
the score for the complex input would 
have been .7 in., while the average 
error metric with the simple input 


least 
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would have been 1.1 in. Thus, even 
if S were to filter out the noise equally 
well in the simple and the complex 
input conditions, one would expect 
a convergence of the functions such 
as that apparent in Fig. 1 and 2. 

Analyses of variance were applied 
separately to the data of the pursuit 
and compensatory tasks. Separate 
analyses were required here as a com- 
bined analysis would result in a con- 
founding of noise locus effects and 
task difficulty. Further, the analyses 
did not include the data from the 
initial noise-free condition, No. The 
results of these analyses are presented 
in Table 2. The analysis for the 
pursuit conditions (I, R, I and R) 
shows that each of the main effects 
of Noise Locus, Noise Magnitude, and 
Course Complexity, is significant. In 
addition, the Locus X Magnitude in- 
teraction is significant at P < .01, 
and the Course Complexity X Magni- 
tude interaction is 
P < .05. 

The No conditions were not in- 
cluded in the analyses of variance 
reported in Table 2 because, as 


significant at 


indicated above, No was experienced 


rABLE 2 


ANALYSES OF VARIANCE OF THE DaTA 
DEFINING FIG. 1 AND 2 


Source 


Compensa 


> 
Pursuit tory 


118.54** 
(4.03)* 


Noise magnitude (M) 
M X Ss 

Noise locus (L) 

L x Ss 

Input complexity (C) 
C X Ss 


317.72* 
(2.48)* 
18.25** 
(21.02)* 
9.82** 
(6.50)* 


61.66** 
(6.48)* 


Residual 


*P < 05 
*P < 01 
* Error term 
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prior to Ny, Ne, and Nz, and therefore 
No did not appear in the systematic 
combination of noise level, noise locus, 
and input complexity over Sessions 
2-5. Instead, separate ¢ tests were 
made between the No and N, condi- 
tions for each of the noise loci. The 
results of these tests for the simple 
input (Fig. 1) indicate that all differ- 
ences between No» and N, were signifi- 
cant (P < .01) with the exception of 
the R condition (P > .05). For the 
complex input (Fig. 2), however, no 
significant differences were demon- 
strated between Ny and N,. Addi- 
tional ¢ tests were then applied to 
the Ny, — N2 differences. In _ this 
instance, all differences were signifi- 
cant (P < .01) except that for the R 
condition. 

Subsequent to the above analyses, 
the Duncan Range Test (1955) was 
applied to the data defining the 
several functions in Fig. 1 and 2. 


Again, the No data were excluded. 
As a result of this application and 


from the above ¢ tests, it was estab- 
lished that the I, I and R, and E func- 
tions have significant slopes (P < .05), 
whereas none of the points on the R 
functions differ from the 
condition (P > .05). Thus, the two 
slope constants for the R condition, 
Table 1, do not 
significantly from zero. 


noise-free 


as listed in differ 


DISCUSSION 


The above results clearly support the 
hypothesis that the various display ele- 
ments contribute unequally to the ac- 
curacy of controlling responses of the 
human predicted from 
Poulton’s findings (1957), performance 
suffers markedly the target in- 
formation alone (I) or the target and 
cursor information (I and R) is obscured 
simultaneously by visual noise, whereas 
a much smaller performance decrement 
occurs when the same perturbation is 


operator. As 


when 
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added to the feedback information alone 
(R). In addition, the prediction that 
noise added to the error information 
(E) in a compensatory display would 
have an effect similar to that for Il and R 
in a pursuit display is upheld. Unlike 
the Poulton study, however, no reliable 
difference was found between the I and 
the I and R condition, and the decrement 
resulting from the addition of noise to 
the R element was not significant. Thus, 
it appears that degradation of the visual 
feedback information (R) by intermit- 
tence more detrimental relative 
effect on performance than degradation 
by noise. 

The implication of these findings is 
that visually coded feedback of response 
information to operator 
performance than is input information. 
Since both the present study and that 
reported by Poulton lead to the same 
general conclusion, it is suggested that 
this differential effect is a general prin- 
ciple applying to a variety of conditions 
which result in a degradation of the 
display of information signals. 

This unequal effect of display degrada- 
tion may be explained, in part, by con- 
sidering the alternatives available to 
S under those when visual 
noise is added to the input signal, as 


has a 


is less critical 


conditions 


compared to the alternatives when it is 
added to the feedback element. In the 
former, S may either filter out the noise 
as instructed, i.e., he attempts to discrim- 
inate signal from noise, or he may track 
the noise signal itself to a certain extent, 
in which case tracking error would be 
directly proportional to the magnitude 
of the noise. However, when the feed 
back information is obscured, another 
alternative is available: S may partially 
disregard the visually coded feedback 
information and rely to a greater extent 
on proprioceptive cues. That it is 
possible for him to learn to utilize these 
cues is suggested by Poulton (1957). 
The use of proprioceptive cues as a 
source of response feedback information 
is particularly appropriate in this track- 
ing task: S enjoyed positional control 
over the system, and thus the amount 
of movement of the R element on the 
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pursuit display was directly proportional 
to the extent of control device rotation. 
As such, there was maximum agreement 
between the visually coded feedback 
information and that available 
the amplitude cues which arise from arm 
movement. Birmingham and Taylor 
identify this as a situation with a high 
level of “‘stimulus-response integrity” 
(1954). It has been demonstrated else- 
where (Bahrick, 1957) that practiced 
Ss can effectively utilize such proprio- 
ceptive cues in direct positioning tasks. 
It is therefore logical to assume that the 
present Ss were able to rely on these 
cues and thereby proficient 
levels of performance, even with large 
magnitudes of visual noise on the R 
element of the pursuit display. 

If this hypothesis is correct, the su- 
perior performance under the R condi- 
tion over that with the other locus condi- 
tions would be expected to decrease as 
the “integrity” visual and 
proprioceptive feedback information is 
reduced. Thus, if S were required to 
track through unquickened velocity or 
acceleration dynamics (Birmingham & 
Taylor, 1954), rather than with a posi- 
tional control as in the present study, 
then proprioceptive cues arising from 
control movements would not be directly 
proportional to displayed feedback infor- 
mation. Assuch, S might find it difficult 
to substitute proprioceptive for visual 
cues, and the effects of visual noise on 
the R condition would be expected to 


from 


maintain 


between 


approach that on the I and E conditions. 
Additional research is required to test 
this hypothesis. 


It has been observed elsewhere that, 
in the range of input frequencies utilized 
here, compensatory modes of informa- 
tion display result in proficient 
tracking than the pursuit mode (Cherni- 
koff & Taylor, 1957). The same result 
is apparent in the data. As 
indicated earlier, however, whereas the 
intercept values of the E and the I and R 
functions in Fig. 1 and 2 are significantly 
different, the slope constants are ap- 
proximately the same Table 1). 
Thus, the addition of visual noise to the 
E element produces performance decre- 


less 


present 


(see 
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ments of the same order as those obtained 
when either the I or the I and R elements 
are perturbed. This finding conforms to 
the present hypothesis in that S cannot 
substitute proprioceptive feedback cues 
for visually coded error information (the 
compensatory mode) any more than he 
can do so for visually coded input 
information (the pursuit mode.) 

It may be concluded that S performs 
very inefficiently when required to track 
a coherent path through noise. When 
unable to substitute proprioceptive cues, 
as in the I, E, and the I and R conditions, 
it is apparent that S actually tracks the 
noise signals to a considerable extent 
(Fig. 1 and 2). As a demonstration of 
this, consider the N; point on the I 
function in Fig. 1. The average error 
at that point is .15 in., and this actually 
exceeds an average error which would 
be predicted by assuming that S attempts 
and succeeds in tracking perfectly the N; 
signal (.10 + .03 = .13in.). An exami- 
nation of the other points in Fig. 1 and 
2 reveals that average tracking error in 
most cases approximates and even ex- 
ceeds the error level predicted, as in the 
above example, from the average devia- 
tion measures of the noise magnitudes. 
Thus, S was not able to disregard the 
random perturbation of the input dis- 
play elements despite near-asymptoti 
levels of skill on the basic tracking task. 


SUMMARY 


[wenty-four Ss each tracked four con- 
secutive 65-sec. trials under 28 combinations 
noise magnitude (four levels in- 
cluding a noise-free condition), locus of per- 
turbation (four loci), and input complexity 
(two levels). Visual noise 
random perturbation of elements of the 
tracking display, and included noise on (a) 
the target element alone (1), (6) the cursor 
element alone (R), (c) both the I and R 
elements simultaneously, and (d) the error 
indicating element (E) of a compensatory 
display. A pursuit display mode was em- 
ployed for the I, R, and I and R noise loci. 
Ihe two inputs included a simple 10 cpm 
course and a more complex 10 + 30 cpm 
target course. The noise signals had a peak 
frequency of 30 cpm with a two octave band- 


of visual 


was defined as 
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width. Integrated absolute tracking 


served as the performance metric. 


error 


The results show (a) error increased in 
proportion to the magnitude of visual noise, 
with the R condition having the least decre- 
ment and the I, I and R, and E 
having approximately the same decrement, 
which was significantly than that 
with the the over-all 
levels of tracking error were lower for the 
simple input compared to those with the more 
complex course; however, a convergence of 
the functions appeared at the higher levels 
of noise magnitude. It is concluded that S 
did not achieve a proficient filtering level and 
probably tracked the visual noise to a con- 
siderable extent. Finally, the agreement be- 
tween these data and the results of Poulton 
on intermittence of comparable display ele- 
ments (1957 principle 
that degradation of visually displayed infor- 
mation in a continuous control task will have 


conditions 


greater 
R condition, and (5) 


suggests a general 


a differential effect on performance depending 


on the locus of such degradation 
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TWO-CHOICE BEHAVIOR OF RHESUS MONKEYS IN 


A 


NONCONTINGENT SITUATION ! 


WILLIAM A. WILSON, JR. 


University of Colorado 
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A. 


y 


Several experiments (Estes & Straug- 
han, 1954; Grant, Hake, & Hornseth, 
1951; Humphreys, 1939; Jarvik, 1951) 
on human behavior in probability 
learning situations have produced 
results compatible with the so-called 
event-matching proposition, that the 
asymptotic distribution of responses 
in a two-choice situation will mirror 
the distribution of reward. In other 
studies (Bush & Mosteller, 1955, pp. 
294-296; Detambel, 1955), conflicting 
results have been obtained: Ss have 
chosen with a probability approaching 
1 that response which is rewarded on 
the higher percentage of the trials. 

Attempts have been made to ex- 
plain these different results in terms 
of different experimental procedures, 
using the distinction between con- 
tingent and noncontingent situations. 
In the contingent situation the infor- 
mation available to S after he makes 
a response depends upon which re- 
sponse was made; specifically, S can 
learn only whether or not the response 
he made on a given trial was rewarded 
on that trial. In the noncontingent 
situation, the information available 
to S after he makes a response does 
not depend upon which response was 
made; he can see whether or not the 
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other response would have been re- 
warded if it had been chosen. With 
some success it has been shown (Bush 
& Mosteller, 1955) that noncontingent 
experimental procedures were em- 
ployed in human experiments leading 
to event-matching behavior and con- 
tingent procedures in those producing 
100% selection of the more favorable 
alternative. 

Most of the animal studies (Bruns- 
wik, 1939; Stanley, 1950) previously 
reported have used a contingent pro- 
cedure. Inarecent experiment, Bush 
and Wilson (1956) designed a non- 
contingent procedure for paradise fish. 
The experimental Ss could see E place 
food in the correct goal compartment 
after each trial regardless of the choice 
Shad made. These Es compared two 
specific models of the Bush-Mosteller 
stochastic learning model. 
to an 


According 
“information model,” each trial 
increases the tendency of S to choose 
on future trials the alternative re- 
warded on that trial, but the influence 
of the placernent of reward on a given 
trial is independent of S’s choice on 
that trial; this model predicts that 
Ss will display event-matching be- 
havior at asymptote. According to a 
“secondary reinforcement model,” the 
response made on a given trial is rein- 
forced whether or not S consumes the 


reward—by primary reinforcement if 


S has made the “‘correct”’ response, 


and by (less effective) secondary rein- 
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forcement if S has made the incorrect 
response and can only see the reward. 
This model predicts that each S will, 
at asymptote, choose the more favor- 
able alternative with a probability of 
either 1 or 0, and that a higher propor- 
tion of Ss will have an asymptote of 1. 
Bush and Wilson viewed their results 
as supporting the secondary reinforce- 
ment model. 

The writers were interested in 
studying monkeys in a similar non- 
contingent experiment to determine 
whether this intermediate species 
would show behavior more similar to 
that of humans or to that of paradise 
fish. The behavior of Ss using the 
noncorrection Bush- 
Wilson led us to modify the procedure 


procedure of 
to allow for immediate self-correction. 


METHOD 


Subjects Ten rhesus 


monkeys served as Ss for the first part of the 


immature naive 


— 


4 


60 


% RESPONSES TO 75% SIDE 





experiment. One died, and the experiment 
was completed with the remaining nine. 

A pparatus.—The experiment 
ducted in a Wisconsin General Test Appa- 
ratus (WGTA). When S’s door of the 
WGTA was raised, he faced a black horizontal 
surface upon which were two black plastic 
boxes separated by a transparent plexiglass 
barrier. The boxes, which served as covers 
for shallow food wells, were hinged at the 
back, weighted, and connected so that if the 
front of either box were slightly raised both 
would fly open. The arrangement was such 
that when SS chose a box by raising it, he could 
see both food wells but could only 
immediately the one he had chosen 

Procedure days of pre- 
liminary training Ss were adapted to the 
apparatus and learned to open the food well 
covers to obtain reward. All preliminary 
trials were forced, and equal experience with 
both boxes was provided. Following this 
period the various training series began 
Each S was given 32 trials daily 
Training continued without interruption until 
completion of the experiment 


was con- 


reach 


During seven 


massed 


rhe Ss were 
approximately 22 hr. hungry and the reward 
was always a half-peanut 

Part 1.—The procedure was modelled after 


that of Bush and Wilson (1956). Subjects 
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Dotted curves indicate 


Mean performance in noncorrection situation: 


(Left side: Part 1; Right 
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five Ss originally trained to preferred side; dashed curves 
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were presented with a position discrimination 
in which the more favorable side was re- 
warded 75% of the time, the less favorable 
side 25% of the time. The position of the 
reward was randomized within blocks of 32 
trials, and the probabilities were ‘‘dependent,” 
i.e., reward was placed on only one side in a 
single trial. Subjects were not allowed to 
correct their choices, but they could see both 
food wells after each choice. For half of the 
Ss the left the favorable 
Eight days of this training were given. 

Part 2.—The procedure of Part 1 was 
repeated with the reward schedules reversed 
for each S so that 


was more side. 


the more favorable side 
on Part 1 was now less favorable, and vice 
versa. Training continued for 12 days. 

Part 3. Part 2 
were reversed, and a correction procedure was 
adopted. When S opened the unbaited box, 
he was permitted to reach around the barrier 
and obtain the peanut in the other food well 
Training was continued for 60 days under 
these conditions 


The reward schedules of 


Between Parts 3 and 4 each S was given 
an individual designed to 


change the probability of his choosing a given 


series of trials 


% RESPONSES TO 75% SIDE 
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side from the value existing at the comple- 
tion of Part 3 to approximately .50. On the 
trials of a single day, reward wes either 100% 
on one side and none on the other, or ran- 
domly distributed 50% to each side. All Ss 
were required to meet a criterion of choosing 
each side at least 13 times in a daily series 
of 32 trials when the reward schedule was a 
random 50%—50%. 

Part 4.—The training was identical with 
that of Part 3. For each S the side chosen 
more often on the later trials of Part 3 was 
made the less often baited (.25) side. 

Part 5.—The side chosen more often on the 
later trials of Part 4 was made the less favor- 
able one and the procedure 
of Part 1 was used, for a period of 12 days 


“ + 49 
noncorrection 


RESULTS 


Parts 1 and 2.—All Ss showed an 
initial preference for the left side, and 
opened the left box on more than 
half of the trials on the first day. 


When they were incorrect, Ss usually 
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Fic. 2. Mean 


performance in correction situation, 


Part 3. Broken lines indicate sub- 


groups defined on the basis of Ss’ responses on the final block of trials. 
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Mean performance in correction situation, Part 4 


Fic. 3. 


Points indicate individual scores 


on first and last blocks of trials 


looked at the reward, and on early 
trials, tried to obtain it. Despite 
their preferences, all Ss quickly began 
to go most of the time to the box that 
was rewarded 75% of the time; on the 
eighth day, all Ss went to this more 
correct box on all 32 trials. The 
left side of Fig. 1 shows the daily 
means for the total group on this 
problem, and the means for the sub- 
groups that had the left (top curve) 
or the right (bottom curve) box as the 
.75 alternative. As the right side of 
the figure shows, Ss quickly reversed 
when the reward contingencies were 
reversed, and within 12 days chose 
the more 

of the time. 


favorable box almost all 


Part 3.—In Fig. 2 the solid line 
shows the mean performance of the 
total group on the 1,920 trials of 
the first correction experiment. The 
mean proportion of choices of the .75 
side reached approximately 60%, and 
had not stabilized. The 
figure also shows the course of learn- 
ing for two groups defined a posteriori 
on the basis of their performance on 
the last block of the 
curve presents 


obviously 


trials; upper 


mean performance 
throughout this part of the experiment 
for six Ss whose individual percentages 
of choice ranged from 74.2 to 100 on 
the last block; the lower curve is for 
three Ss with final scores of 0.8%, 


1.6% and 6.2% choices of the more 
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baited side. None of the Ss 
in the latter group ever went to the 
more often baited side more than 25% 
of the time in any block of trials; 
in other words, any S that ever chose 
the .75 side more than one-quarter 
of the time continued to increase his 
choices of that side. Individual Ss 
who eventually went predominantly 
to the more often baited side initially 
responded approximately 100% of the 
time to the other side for from 3 to 
46 days. 

Part 4.—Individual Ss took from 
5 to 54 days to meet the criterion for 
approximately equal choice of the 
two boxes. The mean curve (Fig. 3) 
for choices of the .75 side during Part 
4 shows a slight increase and may 
still not be at asymptote. The points 
on the figure show the distribution 
of Ss on the first and last blocks of 
trials; the rank order correlation be- 
tween these scores is .64. All of the 
Ss with between 20% and 80% choices 
of the more often baited side on the 
first block of trials their 
choices of this side. 


often 


increased 
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Part 5.—When the noncorrection 
procedure was reinstated, all Ss went 


to the more favorable side approxi- 


mately as quickly and consistently as 
in Part 1 (see Fig. 4). 


DISCUSSION 


The attempt in this experiment to 
produce a noncontingent situation was 
successful since by the construction of 
the apparatus and by the observed 
behavior of the Ss, on each trial S could 
and did see (a) what outcome resulted 
from his response, and (b) what outcome 
would have followed the other response 
if it had been made. Despite this at- 
tempt to provide a situation likely to 
result in event-matching behavior, all 
Ss in Part 1 adopted at asymptote the 
“most rational’’ procedure of always 
responding to the more often correct 
These results differ from 
obtained in noncontingent situations 
with either paradise fish (where Ss ap- 
either 0% or 
(where an event- 


box. those 


proach asymptotes of 
100%) or humans 
matching asymptote is reached). 

The fact that all 10 Ss chose the more 
favorable side so regularly is inconsistent 
with the information model, which pre 
dicts that each S will choose this side 
75% of the time. It is strong 
an argument against those models (the 


not so 
a» = 1 model for contingent events or 
the secondary reinforcement model) which 
predict that Ss will go at asymptote to 
this side with probabilities of either 1 or 
0, and with a larger percentage of Ss 
going with probability 1; for it may be 
that our sample was not sufficiently large 
to include any of that small percentage 
that would have gone consistently to the 
less favorable side. However, the fact 
that all Ss went to the .75 side when they 
began with ps of approximately 0 (Part 
2) is further evidence against such models, 
since any S with a p of exactly 0 should 
not be affected by additional trials. 

The possibility remains that Ss were 
responding just as they would have in a 
contingent where they 


situation were 
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given no information about the outcome 
of the response not made. One possible 
interpretation of such a suggestion is 
that visual information alone is not 
equivalent to the tactual and gustatory 
information available when S actually 
retrieves and eats a peanut, so that a non- 
correction procedure for food reward can- 
not be a truly noncontingent one for the 
monkey. In an attempt to overcome 
this argument, the correction procedure 
of Part 3 was used. Once again there 
is no evidence that all Ss are going to an 
asymptote of .75. On the other hand, 
there are Ss that are responding to the 
more favorable side with ps approaching 
0, but the course of behavior exhibited 
by the majority of Ss suggests that these 
animals would have eventually reversed 
if training had been continued longer. 
Since all Ss had begun Part 3 with ps 
approaching 0, it is not surprising, par- 
ticularly considering the inefficiency of 
this training procedure, that some of 
them remained at approximately that 
level of performance throughout this 
correction situation. 

In Part 4 we tried to provide a situa- 
tion in wh‘ch any S's choice of the less 
correct side on a preponderance of the 
trials would be an impressive argument 
for a secondary reinforcement model, 
but were only partially successful in 
meeting the condition of having all Ss 
begin with a p of 0.5. Granted the 
limitations of our test, there was no S 
who began this training with a p ap- 
proximately .5 and ended it with a 
smaller value. The group curve is still 
rising at the end of the series, 
suggests that 
tinuing to increase their proportions of 


which 


individual Ss are con- 


choice of the .75 side. The asymptotic 
mean performance of the monkeys can- 


not be predicted with certainty, but once 


again it is possible that all Ss would have 


reached levels of 
100% 


baited side. 


eventually approxi 


mately choice of the more often 


We cannot be certain that, in our Ss, 
level of motivation, attitude, or other 


factors that have been said to affect 


behavior in these probability learning 
problems are comparable to those of 
humans in formally similar situations; 
we can say that one sustained attempt 
to get event-matching behavior from 
rhesus monkeys has not been able to 
produce it. 


SUMMARY 


Ten monkeys were trained in a probability 
learning situation, one response being re- 
warded on 75% of the trials, and the alterna- 
tive response on the remaining 25°% of the 
trials. After each response, S also could see 
whether the response he did not make would 
have been rewarded if it had been made 

When correction of an unrewarded re- 
sponse was not permitted, all Ss quickly 
reached an asymptote of approximately 100% 
choice of the .75 side, regardless of their 
initial probabilities of choice of that more 
favorable side. 

When immediate correction was allowed, 
most Ss slowly increased their proportion of 
choices of the .75 side. The data do not 
support the conclusion that all Ss were ap- 
proaching an asymptote of 75% choice of the 
.75 side; they are consistent with the sugges- 
tion that with sufficient testing all would have 
chosen the .75 side approximately 100% of 
the time. 

The behavior of monkeys in this situation 
is not compatible with either an “informa- 
tion” or a “secondary reinforcement” model. 
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Since the Zeigarnik (1927) effect 
was first discovered, the effects of 
interruption on recall have been in- 
vestigated many times in attempts 
to refine the determining conditions. 
[wo groups of investigations 
are relevant the present experi- 
ment, those attempting to find a 
possible physiological basis for the 
differential recall (Freeman, 1930; 
Smith, 1953), and those manipulating 
the attitude of Ss to the interruption 
(e.g., Glixman, 1949; Lewis & Frank- 
lin, 1944; Rosenzweig, 1943). 

It has been suggested that the state 
of the bodily musculature constitutes 
a possible physiological explanation 
of the Zeigarnik effect. Smith (1953) 
has shown that muscular tension in 
the active arm and in the speech 
muscles increased progressively up 
to the moment of completion or 
interruption of a simple mirror-draw- 
ing task. In the former case, tension 
then dropped sharply, whereas, when 
interruption occurred, it remained 
high for a short time. Thus it was 
possible to discriminate between an 
interrupted and a completed task at a 
muscular level, although in this experi- 
ment Smith made attempt 


these 
to 


no to 


investigate subsequent recall. 
Smith hypothesized that this con- 


muscular activity after 
the self-main- 


tinuance of 
interruption reflects 
taining action of a ‘‘phase sequence”’ 
(Hebb, 1949), which by reason of its 
incomplete organization is likely to 
dominate later recall. Finished tasks, 
on the other hand, will have corre- 
spondingly better-organized phase se- 
quences which will have a greater 


TENSION 
PTED 


IN 
TASKS 


THE RECALL 
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and 
into 


probability of “‘short-circuiting,”’ 
so will be less likely to intrude 
later remembering. 

However, this formulation appears 
to ignore those experiments in which 
the attitude of S the task 
is experimentally determined. The 
results of such experiments (Glixman, 
1949; Rosenzweig, 1943) generally 
show that when S is ego-oriented and 
interprets the interruption as failure, 
there is poorer recall of the inter- 
rupted J tasks, i.e., a reversal of the 
normal Zeigarnik effect where the J 
tasks are recalled more readily. If 
it can be shown that under these 
conditions also, an increase in muscu- 


towards 


lar tension accompanies interruption, 
then it automatically follows that it 
is wrong to attribute any mnemonic 
significance to the higher tension. 
The present experiment was designed 
to check this possibility. 


METHOD 
The 


graduates between 
studying either sociology or mathematics, 
none of whom had heard of the Zeigarnik 
effect or suspected the purpose of the experi- 
ment. All Ss were right-handed. 

A pparatus.—Eight diagrams were 
a mirror-drawing task. The diagrams were 
the stenciled outlines of simple objects (a 
house, axe, car, spade, pencil, lamp, windmill, 
and ship) with additional details filled in with 
colored ink to ensure they could be easily 
recognized. The outlines measured 30 cm 
in length in the case of the tasks which S was 
to be allowed to complete, and 35 cm. in those 
tasks which were interrupted. Inter- 
ruption always occurred as near as possible 
to 5 cm. from the end of the task, so that 
approximately the same amount of work 
was performed in all cases. 


40 women under- 
of 18 and 21 


Ss were 
the 


Subjects 


ages 


used in 


to be 
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Ihe electromyograph was of private design 
(Forrest, 1956) and was used in conjunction 
with an integrator unit and a Kelvin-Hughes 
high-speed pen recorder which recorded on 
Teledeltos paper. A built-in calibrator was 
used to check the amplification before and 
after each set of tasks. 

Bipolar electrodes of the sponge type 
soaked in saline were attached to the dorsal 
surface of the active arm half-way between 
the lateral epicondyle and the styloid process 
of the ulna (J. F. Davis, 1953). An earthing 
plate was taken from the upper arm. 

Procedure.—All Ss were told beforehand 
that they would be taking part in an investi- 
gation into muscular tension during drawing. 
They were assigned at random either to the 
ego- or task-oriented group. Instructions 
then differed for the two groups. The mem- 
bers of the task-oriented group were each 
told: 

“You are a lucky subject as you're acting 
as a control for the other experimental group. 
We're not interested so much in how well 
you draw—in fact we're not interested in 
your ability at all—but rather in how good 
the material is for the task. We're going to 
record muscle potentials for this purpose. 
You may be stopped before you finish a task 
if we have got enough information about that 
task from what you've done.” 

Individuals in the ego-oriented group were 
read the following instructions: 

“The purpose of this experiment is to 
measure your ability at a skilled task. Your 
performance will be compared with that of 
other members of the class and then we will 
show how it is associated with intelligence 
If you should do very badly at a task you 
will be stopped before the end is reached. 
50 «interruption failure. Is that 
clear?” 

Both groups were then told: 

“The task consists of a series of simple out- 
line figures which have to be traced within a 
certain time limit while looking in a mirror 
It is necessary for you to start and stop at 
the very instant that the command ‘Start’ or 
‘Stop’ is given. Keep your pencil on the 
paper at all times and do not take it off 
when the command ‘Stop’ is given. Please 
keep it at the point where you stopped for a 
short period of time until you are asked to 
remove it. Are there any questions? ”’ 

All Ss were reassured that there was no 
possibility of an electric shock. Then the 
electrodes were attached and a trial recording 
made while’S practiced on a star pattern. 
The S was then given the first task of the 
series of eight. A rest period preceded the 


signifies 
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actual drawing. During this rest period, S 
was asked to hold her pencil at the beginning 
of the diagram and to plan out her future 
movements around the diagram. Continu- 
ous recordings were made from the beginning 
of the rest period until 5 sec. after interruption 
or completion. Interruption took place by 
E saying quietly, “Stop. Keep your pencil 
still.” This was also said on those trials 
when S was allowed to complete the diagram, 
and in both cases a marker pen was activated 
on a second trace of the pen recorder at the 
moment of speaking. 

Each S was interrupted on four of the tasks 
and allowed to complete the remaining four, 
both the order of presentation and the choice 
of tasks for interruption or completion being 
determined at random. 

As soon as the drawing procedure was 
over, S was given a cancellation test at which 
she worked for the next 10 min. Most Ss 
appeared to believe that muscular tension 
levels were still being recorded during this 
task. 

The S was then asked to recall in any order 
she wished the objects she had traced. The 
time takén to recall each object was recorded 
and the order of recall noted. All Ss re- 
ported that this came as a surprise, and none 
of them had tried to remember the objects 
when they were drawing them. 

Quantification.—In view of Smith's (1953) 
finding that integrated action potentials over 
a period of 1 min. after interruption or com- 
pletion were similar, but that averages over 
2-sec. periods of time discriminated highly, 
it was decided to quantify in a similar way to 
his, i.e., by measuring spike deflections rather 
than integrator levels. 

The number of spikes occurring in each 
period of 1 sec. after the moment of stopping 
were counted and 10 spikes chosen at random 
from each of these periods. These 10 spikes 
were measured to the nearest .5 mm. and 
these measurements averaged and converted 
into microvolts 


RESULTS AND DISCUSSION 


It is, first, essential to know whether 


the instructions were effective in 
bringing about a difference in the 
relative ease with which J tasks were 
recalled. 

For each § the number of recalled 
I tasks minus the number of recalled 
C tasks was calculated, these were 
then ranked and a U test applied to 
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compare the two groups 
under different conditions. A U value 
of 90 was obtained with nm; = n. = 20, 
which gives a significance level of 
<.01. The task-oriented Ss showed 
the classical Zeigarnik effect and 
remembered more interrupted tasks, 
whereas the ego-oriented instructions 
were effective in causing a reversal 
of this result with the other group. 

Now if the augmented tension 
found by Smith is the basis of im- 
proved recall, a difference in tension 
should be found between J and C 
tasks for the two groups. In the 
case of the task-oriented Ss, we should 
expect a higher average tension level 
to accompany the interruption rather 
than the completion of a task (as 
more members of this group recalled 
more interrupted than completed 
tasks), while in the ego-oriented 
group, the situation would have to be 
reversed. 

Table 1 presents the mean tension 
levels for both groups over each 
second of the 5-sec. rest period that 
followed interruption or completion. 

In the case of the task-oriented 
group, the interrupted tasks were 
accompanied by a higher tension level 
than the completed tasks over the 
first second of the rest period, and 
the probability of such a difference 
arising by chance is <.05, using a / 
test. This was to be expected, as it 
is in line with Smith’s results. Thus, 


working 


rABLE 1 
MEAN TENSION LEVELS (xv.) OveR EACH 


SECOND OF Rest PERIOD 


Seconds After Stop Signal 


Group 


lask- 
oriented 

Ego- 
oriented 
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for this group, higher muscular ten- 
sion is associated with better recall. 

However, the ego-oriented group 
also showed an effect in the same 
direction and with a significance level 
of <.01. Yet this group remembered 
the completed tasks more easily, that 
is, those accompanied by less muscular 
tension. 

It seems impossible to maintain the 
thesis that a high level of tension in 
the active arm is a necessary basis for 
improved recall. This is not to say 
that other loci of tension do not exist 
elsewhere in the musculature which 
may discriminate between remem- 
bered and forgotten tasks. But in the 
present experiment, higher tension 
levels appear to be the result of the 
physical act of interruption. 

A comparison of the average muscu- 
lar tension exerted by the two groups 
of Ss revealed no significant difference 
between them. Thus the fact that 
one group was given more formal 
instructions and was led to regard 
interruption as failure was not re- 
flected in their muscular 
records. 


tension 


When the course of muscular ten- 
over the after the stop 
signal is examined, it can be seen that 
there is a considerable drop in tension 
over this period of time, and that the 
differences between J and C tasks 
are most marked for both groups at 
the beginning of the interval. Over 
the second, third, fourth, and fifth 
seconds the differences between J and 
C tasks are not significant for either 
group. It probable that if 
recording had been continued, this 
state of affairs would not have altered, 
as residual tension would have reached 
a true resting level. It 


sion 5 sec. 


seems 


is therefore 
unlikely that such a momentary phe- 
nomenon could have served as an 
adequate basis for a mnemonic process 


which was to last at least 10 min. 
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This theoretical difficulty is admitted 
by Smith, who likewise found a signifi- 
cant difference between J and C tasks 
in the 2-sec. periods immediately follow- 
ing the drawing, but not when longer 
postperiods were compared. 

This is not a decisive drawback for 
any theory that utilizes the phase se- 
quence type of explanation, as the neural 
activity presumably need not be reflected 
Although 
the performance of a drawing task may 
be “organized"’ in the sense that tension 
progressively increases up to a maximum 
at the moment of stopping and then 
decreases the next few 
such changes may be specific to this type 


in peripheral tension change. 


over seconds, 
of task; they are not necessarily found 
in nonmotor tasks (e.g., R. C. Davis, 1939; 
Forrest, 1956; Geldreich, 1953). There 


thus seems no good reason for supposing 
that the high muscular tension recorded 
after the interruption of a motor task 


is anything more than a motor phenom- 
enon due to the sudden prevention of the 
continuance of a skilled movement. 


SUMMARY 


This experiment was designed to discover 
whether the high level of muscular tension 
which has been shown to occur after the inter- 
ruption of a motor task is a possible somatic 
basis for the Zeigarnik effect. To this end 
two groups of Ss were employed, one task- 
oriented, the other ego-oriented. An EMG 
record was taken from the active arm dur- 
ing the performance of eight mirror-draw- 
ing tasks, half of which were interrupted. 

As the ego-oriented group showed the 
expected reversal of the Zeigarnik effect, and 
as in both that more 
muscular tension occurred after interruption 


groups it was found 
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than after completion, it was concluded that 
the increased tension was not a 
concomitant of enhanced recall. 


necessary 
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REACTION TIME AS A FUNCTION OF FOREPERIOD 
DURATION AND VARIABILITY! 


LAWRENCE 


KARLIN 


New York University 


In an early experiment, Woodrow 
(1914) studied the relationship be- 
tween simple RT and foreperiod, i.e., 
the time intervening between warning 
signal and reaction stimulus. When 
foreperiods were constant for each 
block of trials (so-called “regular 
preparation’), he found that the 2- 
sec. foreperiod was optimal in that it 
yielded the shortest RT as compared 
to RTs obtained with 
ranging from 1 to 24 sec. 

Because subsequent experiments 
(Huston, Shakow, & Riggs, 1937; 
Klemmer, 1956; Rodnick & Shakow, 
1940; Teichner, 1954) have not agreed 
in confirming Woodrow’s results, the 
present study was concerned with a 
more detailed determination of the 


fc re] eriods 


shape of the functional relationship 
between RT and foreperiod in the 
region of Woodrow’s optimum. 


A second purpose of the present 
study was to determine the way in 
which preparation to respond as in- 
dexed by RT develops during short 
foreperiods of varying length. It was 
hoped that the results would provide 
information concerning whether there 
were characteristic readiness curves 
such as those hypothesized by Wood- 
worth (1938) for relatively 
foreperiods. 


long 


METHOD 


The present study consists of three experi- 
ments. Except where noted the procedure 
is the same for all three. Experiment II was 
performed to introduce some methodological 
refinements and to provide additional data 


1 This research was supported by a grant 
from the New York Arts 
Science Research Fund 


University and 


to support the more detailed analysis required 
by the findings of Exp. |. Experiment III 
was performed as a further check on the 
discrepancy between the results of Exp. | 
and those obtained by Woodrow. 

Subjects.—The Ss were 31 experimentally 
naive college students, eight each in Exp. | 
and II, and 15 in Exp. III 

A pparatus and stimulus conditions 4 240 
cycle tone of 33 db intensity (re .0002 micro 
bar) and duration of .1 sec. both the 
warning signal and reaction stimulus. In 
Exp. II only, the duration of this tone was 
shortened to 
increased to 49 db. 


was 


03 sec., and its intensity was 
the de 
creased duration was to minimize the possible 
ambiguity that may have occurred in reacting 
either to the onset or cessation of the longer 
tone of Exp. I and III 
creased because shortening the tone this much 
made it more difficult to hear 
electronic 
the sequence, 


The reason for 


Intensity was in- 
Three decade 
timers were connected to provide 
ABAC. Timer A controlled 
stimulus duration, Timer C, the rest period, 
constant at 7 Timer B regulated 
Each setting of Timer 
B was programmed by 
prov ide “slightly” 
needed 


secs... and 
foreperiod duration 
a stepping relay to 
varying foreperiods when 
The reaction movement was the up- 
ward release of a spring-loaded reaction key 
RT was measured by a Springfield Timer 
calibrated in milliseconds. The S sat in one 
room, wearing a pair of headphones. In this 
room, which was adjacent to the room con- 
taining the apparatus, no equipment noises 
could be heard by S. 

Procedure.—The following description ap- 
plies only to Exp. land Il. The procedure of 
Exp. III will be given later 
discussion of its results 

Each S in Exp. I and II attended 50-min 


sessions on 


along with a 


five consecutive days at about 
the same time each day 
mental condition, C or V was used in each 
For four Ss the order was CVCVC, 
for the other four Ss it VCVCV. In 
Cond. C, the foreperiods were .5, 1, 2, 
3.5 secs., and each one was repeated in two 
blocks of 21 thus 
42 trials for each foreperiod, yielding a total 


of 168 RTs for the session and a grand total 


Only one experi 


session. 
was 
and 


trials each There were 
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of 840 RTs per S. Foreperiods were ar- 
ranged in ascending-descending (AD) ort 
descending-ascending (DA) order. The four 
Ss in each group described above were further 
subdivided into two groups who received 
either the DA or AD order. In Cond. V, the 
21 trials in each block were divided equally 
among three foreperiods, with the median 
foreperiod corresponding to one of the fore- 
periods of Cond. C, while the other two fore- 
periods were 20% above and 20% below this 
median, respectively. The first trial of each 
block was always the median foreperiod; the 
remaining 20 trials consisted of the three 
foreperiods arranged so that each one followed 
itself and the other two about an equal num- 
ber of times. The foreperiod durations were 
A, .S, .6; .8, 1.0, 1.2; 1.6, 2.0, 2.4; 2.8, 3.5, 4.2 
set 

Condition V had two purposes: (a) to 
guard against attempts to synchronize reac- 
tions to foreperiod duration (Woodworth & 
Schlosberg, 1954) and (6) to provide a basis 
for the construction of the “readiness” curves 
previously referred to. 

In both conditions, the first trial of each 
block was not used. After S was seated, he 
was given instructions which emphasized that 
he avoid premature reactions and have his 
two fingers (second and index) on the reaction 
key so that the contacts would be closed 
prior to the warning signal. When any pre- 
mature reaction was made, S was so informed 
by an oscillating and the was 
repeated. 


tone trial 


RESULTS AND DISCUSSION 


The first day’s data were omitted 
from the results of Exp. I and !I in 
order to equate the number of sessions 
per condition for each S, as well as 
to minimize practice effects. 

The optimal foreperiod.—Table 1 
summarizes the results of Exp. I and 
Il on the relationship between RT 
and foreperiod. These results are 
also plotted in Fig. 1 where the RT- 
foreperiod function for Cond. C is 
shown 'by the broken line. The com- 
parable function for Cond. V may be 
studied by considering RT for the 
median foreperiod in each of the four 
groups of foreperiods which have been 
connected by solid lines. 
tions in 


The func- 


both conditions show that 
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TABLE 1 


RT as A FUNCTION OF FOREPERIOD 
DURATION AND VARIABILITY. 
EXPERIMENTs I AND II 


Duration of Median Foreperiod 


Deviation nc.) 


from Me 
dian Fore 
periods (%) 


Cond. 


Mean 


RT rises from .5 to 3.5 sec. without 
inflection. Another way of describ- 
ing the RT-foreperiod function in 
Cond. V is to average RTs for the 
three foreperiods in each set and plot 
the average RT as a function of 
median foreperiod. These averages 
are not plotted but are given in Table 
1 where inspection shows that they 
follow a similar trend to that depicted 
in Fig. 1. Since no inflection 
obtained, Woodrow’s finding of a 
2-sec. optimum is not confirmed. 

There is also no evidence that the 
results of Cond. C were appreciably 
affected by prematurely originated 
reactions such as might have been 
expected if S were attempting to 
synchronize his reactions to foreperiod 
duration. Not only are the two func- 
tions similar in shape, but the actual 
number of premature reactions is very 
small and does not differ appreciably 
in either condition. 


was 
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Reaction time as a function of foreperiod duration in Exp. I and II. 
N = 8 in each experiment. 


As a further check on the dis- 
crepancy between the present results 
and those of Woodrow, the possibility 
was investigated that “warm-up” 
effects in Woodrow’s experiment pro- 
duced the superiority of the 2-sec. 
over the 1-sec. foreperiod since he 
used only the ascending order of 
foreperiods in each session. Wood- 
row (1914, p. 54) checked the possi- 
bility of unbalanced practice and 
fatigue effects but did not feel that 
it was necessary for his purposes to 
counterbalance them. He seems not 
to have considered the specific possi- 
bility of warm-up effects. Accord- 
ingly, in Exp. III we tested 15 un- 
practiced Ss for only one session 
using the procedure of Cond. C except 
that the .5-sec. foreperiod was omitted 
and only the AD sequence was used. 


As shown in Table 2, the results of 
Exp. I and II are confirmed, and no 
inflection in the ascending series 
which might be due to a “warm-up” 
effect is indicated. However, it is 
worth noting that six Ss did yield 
2-sec. inflections in the 
order as compared to one in the 
descending order. Since Woodrow 
only used three Ss, it may be that 


ascending 


rABLE 2 


RT as A FUNCTION OF tE PERIOD 
Duration: Exp. III 
(N = 15) 


Foreperiod Duration (Sec.) 


Ascending 
Descending 
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sampling factors interacted with his 
conditions to produce either a 2-sec. 
optimum or a nonsignificant difference 
between the 1- and 2-sec. foreperiods. 

Foreperiod variability —W oodrow 
(1914, pp. 28, 29, 35f.) maintained 
that the regular increase in RT for 
foreperiods of 2 sec. duration and 
longer was a result of the decrease 
in the absolute accuracy of time 
estimation. In fact the shape of this 
function suggested the operation of 
Weber’s Law. On the basis of his 
argument, the foreperiod variabilities 
used in Cond. V were varied in direct 
proportion to the median foreperiods. 
Even if Woodrow’s assumption con- 
cerning Weber’s Law were only ap- 
proximately true, it was felt that 
constant relative rather than constant 
absolute variability in Cond. V would 
introduce less change in the shape 
of this function when compared to the 
corresponding function of Cond. C. 
The validity of this rationale is sup- 
ported by the fact that the functions 
for Cond. V and C in Fig. 1 show no 
tendency to diverge, even though the 
range of foreperiod variability in 
Cond. V increases from .2 to 1.4 sec. 


Readiness and the effect of preceding 


foreperiods.—In setting up this study 
it was hoped that Cond. V would 
provide some information on how S 
prepares to react within each block 
of foreperiods since the slightly vary- 
ing foreperiods within each such block 
might catch S at different stages of 
preparation. Thus the curves of 
Cond. V shown in Fig. 1, might be 
interpreted as empirical readiness 
curves (Woodworth, 1938) showing 
the way in which S’s readiness de- 
velops during a given block of fore- 
periods. Following this interpreta- 
tion, the curves for the 1-, 2-, and 
3.5-sec. blocks of foreperiods may 
mean that S is least ready (highest 


RT) when the reaction § stimulus 
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TABLE : 
F Ratios DERIVED FROM VARIANCE ANALY- 
SES OF RT VARIATION DUE TO VARYING 
FOREPERIOD WITHIN Eacu BLock 


Median Foreperiod Duration (Sec.) 


F os based on 2 and 14 df for greater and lesser mez 
squares, respectively, is 3.7. 


occurs at the end of a short foreperiod, 
but increases his readiness with the 
additional time provided by the me- 
dian foreperiod. Beyond the median 
foreperiod, readiness, if it does not 
actually increase, is at least maintained 
above that of the short foreperiod. 
This description clearly does not 
apply to the curves of the .5-sec. 
block which in both experiments show 
a reverse tendency. 

The results of a separate variance 
analysis of each trend shown in Table 
3 suggest that the trends for the 
three longer foreperiods are real, since 
five out of six Fs are significant at 
much better than the .05 level. On 
the other hand, Fs for the .5-sec. 
block are not significant at this level. 
In any case, we can conclude that the 
.5-sec. block of foreperiods yields 
results that differ from those obtained 
with the longer foreperiods, but we 
cannot say at this point that the 
variation within the .5-second block 
is systematic. 


In interpreting the curves of Fig. 1 
as readiness curves, we have neglected 
the fact that each of these curves is not 
based on conditions designed to promote 
preparation for a fixed foreperiod. In 
other words, the curves may not really 
describe the development of a uniform 
set or state of readiness because there 
is no single definable set characteristic 
of the three foreperiods that make up a 
given block. For example, both Klem- 
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mer (1956) and Woodrow (1914) have 
noted that RT is not only influenced by 
the length of a given foreperiod but also 
by the length of the immediately pre- 
ceding foreperiod. Specifically, a short 
foreperiod preceded by a long one may 
catch S “napping,’’ while a long fore- 
period preceded by a short one will not 
have an adverse effect since S, influenced 
by the short foreperiod, gets ready more 
quickly and tends to maintain the readi 
ness so developed. 

It seemed possible that the curves for 
Cond. V in Fig. 1 could be explained 
partly in terms of these effects. For 
example, the consistently longer RTs 
obtained with the shortest foreperiods 
in the 1-, 2-, and 3.5-sec. blocks might be 
due, not to the length of foreperiod per se 
but to the fact that the shortest fore- 
period would be preceded more often by 
longer foreperiods. On the other hand, 
this prediction could not be made for 
the .5-sec. block of foreperiods since the 
shortest foreperiod in this block did not 
yield the longest RT. 

To explore the above suggestions, Fig. 
2 plots readiness curves comparable to 
those in Fig. 1, that it tries to 
make explicit the effects of the imme- 
diately Each of 
these so-called readiness curves is based 


except 
preceding foreperiod. 


only on those foreperiods which have 
been immediately preceded by 
periods of a given magnitude 
ample, the L (solid line) show 
RTs only for those short, medium, and 
long foreperiods within each block that 
have been preceded by a long foreperiod. 


fore- 
For ex- 
curves 


The results shown in Fig. 2 support 
the suggestion that each of the curves 
for Cond. V in Fig. 
different readiness curves. Except for 
the .5-sec. block of this 
interpretation is indicated by the uniform 


1 are composites of 


foreperiods, 


tendency of the curves for long preceding 
foreperiods to drop quite sharply in RT 
within each of the three longer blocks, 


as contrasted with the absence of any 
significant trend for those for short pre- 
ceding foreperiods in the same blocks. 
The absence of any trend in the latter 
fact that the 


preceding foreperiods 


case may be due to the 


curve for short 
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reaches a peak of readiness earlier than 
for long preceding foreperiods, so that 
readiness is not changing any more but 
is merely being maintained. 
Examination of the curves obtained 
for the .5-sec. block of foreperiods sug 
gests that the way in which preparation 
develops is also influenced by the effects 
of the preceding foreperiod. However, 
the trends differ radically from those 
obtained for the three longer foreperiods. 
Thus, the curves for long preceding fore 
periods no longer display any significant 
trend and, in fact, show a slight tend- 
ency to vary in the opposite direction. 
In contrast, the curves for short preced 
ing foreperiods are very significant statis 
tically, showing a marked tendency to 
increase in RT beyond the median fore 
period. These results suggest that the 
patterns of preparation for this block 
may be different from 
those of the longer foreperiods in two 
respects: 


of fe yreperic ds 


(a) a tendency to get ready as 
fast as possible which is superimposed 
on specific tendencies resulting from dif 
ferent lengths of preceding foreperiod, 
and (bd) less ability to maintain a peak of 
readiness. The first conclusion is based 
on the fact that the curve for long fore 
periods does not show an initially higher 
RT which characterized the curves of the 
three longer blocks of long foreperiods 
in both experiments. In effect, this 
means that in the block of fore 
periods, S is not caught “napping’’ when 
The 
mainly on 
the sharp increase in RT displayed by 
the curve for rhis 
result suggests that S is unable to main 


5-sec. 
a short period follows a long one. 


second conclusion is based 


short foreperiods 
tain the peak of readiness shown by the 
curve for short foreperiods even for so 
short a time as an additional .1 sec 
While these interpretations are based 
on the results of both experiments, it is 
worth noting that the trends shown in 


Fig. 2 are much sharper in Exp. II 


SUMMARY 


R | and 
period was studied under two conditions of 
foreperiod variability 


rhe relationship between fore 


Under one condition 
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Reaction time as a function of foreperiod duration in Exp. | and I] showing the 
effects of various lengths of preceding foreperiod. 





REACTION TIME 


there was minimal objective variability, under 
the other there was 20° objective variability. 
The latter condition was used in an attempt 


to reduce the influence of prematurely origi- 
nated reactions, and provide a method for 
studying the development of’ preparation. 
The results obtained suggested the following: 


1. There is no evidence that prematurely 
originated reactions had an appreciable influ- 
ence on either condition. 

2. Woodrow’s finding of a 2-sec 
was not confirmed. 


optimum 


3. The steady increase in RT as a function 
of foreperiod contradicts a previous conclusion 
(Teichner, 1954) that the optimum lies within 
a range of from 2 to 8 sex 

4. RT following a given foreperiod was 
influenced by the magnitude of the foreperiod 
immediately preceding the given foreperiod. 

5. The development of a state of readiness 
followed a similar course during the three 
longer blocks of foreperiods which was dif- 
ferent from the way in which it developed 
during the shortest block of foreperiods 
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